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Description 
TECHNICAL FIELD 

[0001] This invention is in the field of protein expression. In particular, it relates to the heterologous expression of 
proteins from Neisseria (e.g. N. gonorrhoeae or, preferably, N. meningitidis). 

BACKGROUND ART 

[0002] I nternational patent applications W099/24578, W099/36544, WO99/57280 and WO00/22430 disclose proteins 
from Neisseria meningitidis and Neisseria gonorrhoeae. These proteins are typically described as being expressed in 
E.coii (i. e. heterologous expression) as either N-terminal GST-fusions or C-terminal His-tag fusions, although other 
expression systems, including expression in native Neisseria, are also disclosed. 

[0003] It is an object of the present invention to provide alternative and improved approaches for the heterologous 
expression of these proteins. These approaches will typically affect the level of expression, the ease of purification, the 
cellular localisation of expression, and/or the immunological properties of the expressed protein. 

DISCLOSURE OF THE INVENTION 

Nomenclature herein 

[0004] The 21 66 protein sequences disclosed in W099/24578, W099/36544 and WO99/57280 are referred to herein 
by the following SEQ# numbers: 



Application 


Protein sequences 


SEQ# herein 


W099/24578 


Even SEQ IDs 2-892 


SEQ#s 1-446 


W099/36544 


Even SEQ IDs 2-90 


SEQ#s 447-491 


WO99/57280 


Even SEQ IDs 2-3020 
Even SEQ IDs 3040-3114 
SEQ IDs 31 15-3241 


SEQ#s 492-2001 
SEQ#s 2002-2039 
SEQ#s 2040-2166 



[0005] In addition to this SEQ# numbering, the naming conventions used in W099/24578, W099/36544 and 
WO99/57280 are also used (e.g. 'ORF4', 'ORF40', 'ORF40-1' etc. as used in W099/24578 and W099/36544; 'm919', 
'g919' and 'a919' etc. as used in WO99/57280). 

[0006] The 2160 proteins NMB0001 to NMB2160 from Tettelin et al. [Science (2000) 287:1809-1815] are referred to 
herein as SEQ#s 2167-4326 [see also WO00/66791]. 

[0007] The term 'protein of the invention' as used herein refers to a protein comprising: 

(a) one of sequences SEQ#s 1-4326; or 

(b) a sequence having sequence identity to one of SEQ#s 1-4326; or 

(c) a fragment of one of SEQ#s 1-4326. 

[0008] The degree of 'sequence identity' referred to in (b) is preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 
95%, 99% or more). This includes mutants and allelic variants [e.g. see WO 00/6 6741]. Identity is preferably determined 
by the Smith-Waterman homology search algorithm as implemented in the MPSRCH program (Oxford Molecular), using 
an affine gap search with parameters gap open penalty=12 and gap extension penalty=1. Typically, 50% identity or 
more between two proteins is considered to be an indication of functional equivalence. 

[0009] The 'fragment' referred to in (c) should comprise at least n consecutive amino acids from one of SEQ#s 1 -4326 
and, depending on the particular sequence, n is 7 or more (eg. 8, 1 0, 1 2, 1 4, 1 6, 1 8, 20, 25, 30, 35, 40, 50, 60, 70, 80, 
90, 100 or more). Preferably the fragment comprises an epitope from one of SEQ#s 1-4326. Preferred fragments are 
those disclosed in WO00/71574 and WO01/04316. 

[0010] Preferred proteins of the invention are found in N. meningitidis serogroup B. 

[0011] Preferred proteins for use according to the invention are those of serogroup B N. meningitidis strain 2996 or 
strain 394/98 (a New Zealand strain). Unless otherwise stated, proteins mentioned herein are from N. meningitidis strain 
2996. It will be appreciated, however, that the invention is not in general limited by strain. References to a particular 
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protein (e.g. '287', '919' etc.) may be taken to include that protein from any strain. 
Non-fusion expression 

5 [0012] In a first approach to heterologous expression, no fusion partner is used, and the native leader peptide (if 
present) is used. This will typically prevent any 'interference' from fusion partners and may alter cellular localisation 
and/or post-translational modification and/or folding in the heterologous host. 

[0013] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
(a) no fusion partner is used, and (b) the protein's native leader peptide (if present) is used. 
10 [0014] The method will typically involve the step of preparing an vector for expressing a protein of the invention, such 
that the first expressed amino acid is the first amino acid (methionine) of said protein, and last expressed amino acid is 
the last amino acid of said protein (i.e. the codon preceding the native STOP codon). 

[0015] This approach is preferably used for the expression of the following proteins using the native leader peptide: 
1 1 1 , 1 49, 206, 225-1 , 235, 247-1 , 274, 283, 286, 292, 401 , 406, 502-1 , 503, 51 9-1 , 525-1 , 552, 556, 557, 570, 576-1 , 

15 580, 583, 664, 759, 907, 913, 920-1 , 936-1 , 953, 961 , 983, 989, Orf4, Orf7-1 , Orf9-1 , Orf23, Orf25, Orf37, Orf38, Orf40, 
Orf40.1 , Orf40.2, Orf72-1 , Orf76-1 , Orf85-2, Orf91 , Orf97-1 , Orf1 19, Orf143.1 , NMB0109 and NMB2050. The suffix 'L' 
used herein in the name of a protein indicates expression in this manner using the native leader peptide. 
[0016] Proteins which are preferably expressed using this approach using no fusion partner and which have no native 
leader peptide include: 008, 105, 117-1, 121-1, 122-1, 128-1, 148, 216, 243, 308,593, 652, 726, 926, 982, Orf83-1 and 

20 Orf 143-1. 

[0017] Advantageously, it is used for the expression of ORF25 or ORF40, resulting in a protein which induces better 

anti-bactericidal antibodies than GST- or His-fusions. 

[0018] This approach is particularly suited for expressing lipoproteins. 

25 Leader-peptide substitution 

[0019] In a second approach to heterologous expression, the native leader peptide of a protein of the invention is 
replaced by that of a different protein. In addition, it is preferred that no fusion partner is used. Whilst using a protein's 
own leader peptide in heterologous hosts can often localise the protein to its 'natural' cellular location, in some cases 
30 the leader sequence is not efficiently recognised by the heterologous host. In such cases, a leader peptide known to 
drive protein targeting efficiently can be used instead. 

[0020] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
(a) the protein's leader peptide is replaced by the leader peptide from a different protein and, optionally, (b) no fusion 
partner is used. 

35 [0021] The method will typically involve the steps of: obtaining nucleic acid encoding a protein of the invention; ma- 
nipulating said nucleic acid to remove nucleotides that encode the protein's leader peptide and to introduce nucleotides 
that encode a different protein's leader peptide. The resulting nucleic acid may be inserted into an expression vector, 
or may already be part of an expression vector. The expressed protein will consist of the replacement leader peptide at 
the N-terminus, followed by the protein of the invention minus its leader peptide. 

40 [0022] The leader peptide is preferably from another protein of the invention (e.g. one of SEQ#s 1-4326), but may 
also be from an E.coli protein (e.g. the OmpA leader peptide) or an Erwinia carotovora protein (e.g. the PelB leader 
peptide), for instance. 

[0023] A particularly useful replacement leader peptide is that of ORF4. This leader is able to direct lipidation in E.coli, 
improving cellular localisation, and is particularly useful for the expression of proteins 287, 919 and AG287. The leader 

45 peptide and N-terminal domains of 961 are also particularly useful. 

[0024] Another useful replacement leader peptide is that of E.coli OmpA. This leader is able to direct membrane 
localisation of E.coli. It is particularly advantageous for the expression of ORF1, resulting in a protein which induces 
better anti-bactericidal antibodies than both fusions and protein expressed from its own leader peptide. 
[0025] Another useful replacement leader peptide is MKKYLFSAA. This can direct secretion into culture medium, and 

50 is extremely short and active. The use of this leader peptide is not restricted to the expression of Neisserial proteins - 
it may be used to direct the expression of any protein (particularly bacterial proteins). 

Leader-peptide deletion 

55 [0026] In a third approach to heterologous expression, the native leader peptide of a protein of the invention is deleted. 
In addition, it is preferred that no fusion partner is used. 

[0027] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
(a) the protein's leader peptide is deleted and, optionally, (b) no fusion partner is used. 
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[0028] The method will typically involve the steps of: obtaining nucleic acid encoding a protein of the invention; ma- 
nipulating said nucleic acid to remove nucleotides that encode the protein's leader peptide. The resulting nucleic acid 
may be inserted into an expression vector, or may already be part of an expression vector. The first amino acid of the 
expressed protein will be that of the mature native protein. 
5 [0029] This method can increase the levels of expression. For protein 919, for example, expression levels in E.coli 
are much higher when the leader peptide is deleted. Increased expression may be due to altered localisation in the 
absence of the leader peptide. 

[0030] The method is preferably used for the expression of 91 9, ORF46, 961 , 050-1 , 760 and 287. 

10 Domain-based expression 

[0031] In a fourth approach to heterologous expression, the protein is expressed as domains. This may be used in 
association with fusion systems (e.g. GST or His-tag fusions). 

[0032] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
15 (a) at least one domain in the protein is deleted and, optionally, (b) no fusion partner is used. 

[0033] The method will typically involve the steps of: obtaining nucleic acid encoding a protein of the invention; ma- 
nipulating said nucleic acid to remove at least one domain from within the protein. The resulting nucleic acid may be 
inserted into an expression vector, or may already be part of an expression vector. Where no fusion partners are used, 
the first amino acid of the expressed protein will be that of a domain of the protein. 
20 [0034] A protein is typically divided into notional domains by aligning it with known sequences in databases and then 
determining regions of the protein which show different alignment patterns from each other. 

[0035] The method is preferably used for the expression of protein 287. This protein can be notionally split into three 
domains, referred to as A B & C (see Figure 5). Domain B aligns strongly with IgA proteases, domain C aligns strongly 
with transferrin-binding proteins, and domain A shows no strong alignment with database sequences. An alignment of 
25 polymorphic forms of 287 is disclosed in WO00/66741 . 

[0036] Once a protein has been divided into domains, these can be (a) expressed singly (b) deleted from with the 
protein e.g. protein ABCD — > ABD, ACD, BCD etc. or (c) rearranged e.g. protein ABC — > ACB, CAB etc. These three 
strategies can be combined with fusion partners is desired. 

[0037] ORF46 has also been notionally split into two domains - a first domain (amino acids 1-433) which is well- 
so conserved between species and serogroups, and a second domain (amino acids 433-608) which is not well-conserved. 
The second domain is preferably deleted. An alignment of polymorphic forms of ORF46 is disclosed in WO00/66741 . 
[0038] Protein 564 has also been split into domains (Figure 8), as have protein 961 (Figure 1 2) and protein 502 (amino 
acids 28-1 67 of the MC58 protein). 

35 Hybrid proteins 

[0039] In a fifth approach to heterologous expression, two or more (e.g. 3, 4, 5, 6 or more) proteins of the invention 
are expressed as a single hybrid protein. It is preferred that no non-Neisserial fusion partner (e.g. GST or poly-His) is used. 
[0040] This offers two advantages. Firstly, a protein that may be unstable or poorly expressed on its own can be 
40 assisted by adding a suitable hybrid partner that overcomes the problem. Secondly, commercial manufacture is simplified 
- only one expression and purification need be employed in order to produce two separately-useful proteins. 
[0041] Thus the invention provides a method for the simultaneous heterologous expression of two or more proteins 
of the invention, in which said two or more proteins of the invention are fused (i.e. they are translated as a single 
polypeptide chain). 

45 [0042] The method will typically involve the steps of: obtaining a first nucleic acid encoding a first protein of the 
invention; obtaining a second nucleic acid encoding a second protein of the invention; ligating the first and second nucleic 
acids. The resulting nucleic acid may be inserted into an expression vector, or may already be part of an expression vector. 
[0043] Preferably, the constituent proteins in a hybrid protein according to the invention will be from the same strain. 
[0044] The fused proteins in the hybrid may be joined directly, or may be joined via a linker peptide e.g. via a poly- 

50 glycine linker (i.e. G n where n = 3, 4, 5, 6, 7, 8,9,10 or more) or via a short peptide sequence which facilitates cloning. 
It is evidently preferred not to join a AG protein to the C-terminus of a poly-glycine linker. 

[0045] The fused proteins may lack native leader peptides or may include the leader peptide sequence of the N- 
terminal fusion partner. 

[0046] The method is well suited to the expression of proteins orfl , orf4, orf25, orf40, Orf46/46.1 , orf83, 233, 287, 
55 2 92L, 564, 687, 741 , 907, 91 9, 953, 961 and 983. 

[0047] The 42 hybrids indicated by 'X' in the following table of form NH 2 -A-B-COOH are preferred: 
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10 





ORF46.1 


287 


741 


919 


953 


961 


983 


ORF46.1 
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X 


X 
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X 


287 


X 




X 
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741 
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X 
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X 


919 


X 


X 


X 
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X 


953 
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X 


X 




X 


X 


961 


X 


X 


X 


X 


X 




X 


983 


X 


X 


X 


X 


X 


X 





15 [0048] Preferred proteins to be expressed as hybrids are thus ORF46.1 , 287, 741 , 91 9, 953, 961 and 983. These may 
be used in their essentially full-length form, or poly-glycine deletions (AG) forms may be used (e.g. AG-287, AGTbp2, 
AG741 , AG983 etc.), or truncated forms may be used (e.g. A1-287, A2-287 etc.), or domain-deleted versions may be 
used (e.g. 287B, 287C, 287BC, ORF46 1 _ 4 3 3 , ORF46 43 3_ 608 , ORF46, 961c etc.). 

[0049] Particularly preferred are: (a) a hybrid protein comprising 91 9 and 287; (b) a hybrid protein comprising 953 and 
20 287; (c) a hybrid protein comprising 287 and ORF46.1 ; (d) a hybrid protein comprising ORF1 and ORF46.1 ; (e) a hybrid 
protein comprising 91 9 and ORF46.1 ; (f) a hybrid protein comprising ORF46.1 and 91 9; (g) a hybrid protein comprising 
ORF46.1, 287 and 919; (h) a hybrid protein comprising 919 and 519; and (i) a hybrid protein comprising ORF97 and 
225. Further embodiments are shown in Figure 14. 

[0050] Where 287 is used, it is preferably at the C-terminal end of a hybrid; if it is to be used at the N-terminus, if is 
25 preferred to use a AG form of 287 is used (e.g. as the N-terminus of a hybrid with ORF46.1 , 91 9, 953 or 961 ). 
[0051] Where 287 is used, this is preferably from strain 2996 or from strain 394/98. 
[0052] Where 961 is used, this is preferably at the N-terminus. Domain forms of 961 may be used. 
[0053] Alignments of polymorphic forms of ORF46, 287, 919 and 953 are disclosed in WO00/66741 . Any of these 
polymorphs can be used according to the present invention. 

30 

Temperature 

[0054] In a sixth approach to heterologous expression, proteins of the invention are expressed at a low temperature. 
[0055] Expressed Neisserial proteins (e.g. 91 9) may be toxic to E.coli, which can be avoided by expressing the toxic 
35 protein at a temperature at which its toxic activity is not manifested. 

[0056] Thus the present invention provides a method for the heterologous expression of a protein of the invention, in 
which expression of a protein of the invention is carried out at a temperature at which a toxic activity of the protein is 
not manifested. 

[0057] A preferred temperature is around 30°C. This is particularly suited to the expression of 91 9. 

40 

Mutations 

[0058] As discussed above, expressed Neisserial proteins may be toxic to E coli. This toxicity can be avoided by 
mutating the protein to reduce or eliminate the toxic activity. In particular, mutations to reduce or eliminate toxic enzymatic 
45 activity can be used, preferably using site-directed mutagenesis. 

[0059] In a seventh approach to heterologous expression, therefore, an expressed protein is mutated to reduce or 
eliminate toxic activity. 

[0060] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
protein is mutated to reduce or eliminate toxic activity. 
so [0061] The method is preferably used for the expression of protein 907, 91 9 or 922. A preferred mutation in 907 is at 
Glu-117 (e.g. Glu->Gly); preferred mutations in 919 are at Glu-255 (e.g. Glu-»Gly) and/or Glu-323 (e.g. Glu->Gly); 
preferred mutations in 922 are at Glu-164 (e.g. Glu->Gly), Ser-213 (e.g. Ser^Gly) and/or Asn-348 (e.g. Asn^Gly). 

Alternative vectors 

55 

[0062] In a eighth approach to heterologous expression, an alternative vector used to express the protein. This may 

be to improve expression yields, for instance, or to utilise plasmids that are already approved for GMP use. 

[0063] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
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an alternative vector is used. The alternative vector is preferably pSM214, with no fusion partners. Leader peptides may 
or may not be included. 

[0064] This approach is particularly useful for protein 953. Expression and localisation of 953 with its native leader 
peptide expressed from pSM214 is much better than from the pET vector. 
5 [0065] pSM214 may also be used with: AG287, A2-287, A3-287, A4-287, Orf46.1 , 961 L, 961 , 96 1 (MC58), 96 1 c, 
96 1 c-L, 919, 953 and AG287-Orf46.1 . 

[0066] Another suitable vector is pET-24b (Novagen; uses kanamycin resistance), again using no fusion partners. 
pET-24b is preferred for use with: AG287K, A2-287K, A3-287K, A4-287K, 

[0067] Orf46.1-K, Orf46A-K, 961 -K (MC58), 961 a-K, 961 b-K, 961 c-K, 961C-L-K, 961 d-K, AG287-919-K, AG287- 
10 Orf46.1-Kand AG287-961-K. 

Multimeric form 

[0068] In a ninth approach to heterologous expression, a protein is expressed or purified such that it adopts a particular 
15 multimeric form. 

[0069] This approach is particularly suited to protein 953. Purification of one particular multimeric form of 953 (the 

monomeric form) gives a protein with greater bactericidal activity than other forms (the dimeric form). 

[0070] Proteins 287 and 91 9 may be purified in dimeric forms. 

[0071 ] Protein 961 may be purified in a 1 80kDa oligomeric form (e.g. a tetramer). 

20 

Lipidation 

[0072] In a tenth approach to heterologous expression, a protein is expressed as a lipidated protein. 
[0073] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
25 the protein is expressed as a lipidated protein. 

[0074] This is particularly useful for the expression of 919, 287, ORF4, 406, 576-1 , and ORF25. Polymorphic forms 
of 919, 287 and ORF4 are disclosed in WO00/66741 . 

[0075] The method will typically involve the use of an appropriate leader peptide without using an N-terminal fusion 
partner. 

30 

C-terminal deletions 

[0076] In an eleventh approach to heterologous expression, the C-terminus of a protein of the invention is mutated. 
In addition, it is preferred that no fusion partner is used. 
35 [0077] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
(a) the protein's C-terminus region is mutated and, optionally, (b) no fusion partner is used. 

[0078] The method will typically involve the steps of: obtaining nucleic acid encoding a protein of the invention; ma- 
nipulating said nucleic acid to mutate nucleotides that encode the protein's C-terminus portion. The resulting nucleic 
acid may be inserted into an expression vector, or may already be part of an expression vector. The first amino acid of 
40 the expressed protein will be that of the mature native protein. 

[0079] The mutation may be a substitution, insertion or, preferably, a deletion. 

[0080] This method can increase the levels of expression, particularly for proteins 730, ORF29 and ORF46. For protein 
730, a C-terminus region of around 65 to around 214 amino acids may be deleted; for ORF46, the C-terminus region 
of around 175 amino acids may be deleted; for ORF29, the C-terminus may be deleted to leave around 230-370 N- 
45 terminal amino acids. 

Leader peptide mutation 

[0081] In a twelfth approach to heterologous expression, the leader peptide of the protein is mutated. This is particularly 
50 useful for the expression of protein 91 9. 

[0082] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
the protein's leader peptide is mutated. 

[0083] The method will typically involve the steps of: obtaining nucleic acid encoding a protein of the invention; and 
manipulating said nucleic acid to mutate nucleotides within the leader peptide. The resulting nucleic acid may be inserted 
55 into an expression vector, or may already be part of an expression vector. 
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Poly-glycine deletion 

[0084] In a thirteenth approach to heterologous expression, poly-glycine stretches in wild-type sequences are mutated. 
This enhances protein expression. 
5 [0085] The poly-glycine stretch has the sequence (Gly) n , where n>4 (e.g. 5, 6, 7, 8, 9 or more). This stretch is mutated 
to disrupt or remove the (Gly) n . This may be by deletion (e.g. CGGGGS— > CGGGS, CGGS, CGS or CS), by substitution 
(e.g. CGGGGS— > CGXGGS, CGXXGS, CGXGXS etc.), and/or by insertion (e.g. CGGGGS— > CGGXGGS, CGXGGGS, 
etc.). 

[0086] This approach is not restricted to Neisserial proteins - it may be used for any protein (particularly bacterial 
10 proteins) to enhance heterologous expression. For Neisserial proteins, however, it is particularly suitable for expressing 
287, 741 , 983 and Tbp2. An alignment of polymorphic forms of 287 is disclosed in WO00/66741 . 
[0087] Thus the invention provides a method for the heterologous expression of a protein of the invention, in which 
(a) a poly-glycine stretch within the protein is mutated. 

[0088] The method will typically involve the steps of: obtaining nucleic acid encoding a protein of the invention; and 
15 manipulating said nucleic acid to mutate nucleotides that encode a poly-glycine stretch within the protein sequence. 
The resulting nucleic acid may be inserted into an expression vector, or may already be part of an expression vector. 
[0089] Conversely, the opposite approach (i.e. introduction of poly-glycine stretches) can be used to suppress or 
diminish expression of a given heterologous protein. 

20 Heterologous host 

[0090] Whilst expression of the proteins of the invention may take place in the native host (i.e. the organism in which 
the protein is expressed in nature), the present invention utilises a heterologous host. The heterologous host may be 
prokaryotic or eukaryotic. It is preferably E.coii, but other suitable hosts include Bacillus subtil is, Vibrio cholerae, Sal- 
25 moneila typhi, Salmonenna typhimurium, Neisseria meningitidis, Neisseria gonorrhoeae, Neisseria lactamica, Neisseria 
cinerea, Mycobateria (e.g. M. tuberculosis), yeast etc. 

Vectors etc. 

30 [0091] As well as the methods described above, the invention provides (a) nucleic acid and vectors useful in these 
methods (b) host cells containing said vectors (c) proteins expressed or expressable by the methods (d) compositions 
comprising these proteins, which may be suitable as vaccines, for instance, or as diagnostic reagents, or as immunogenic 
compositions (e) these compositions for use as medicaments (e.g. as vaccines) or as diagnostic reagents (f) the use of 
these compositions in the manufacture of (1 ) a medicament for treating or preventing infection due to Neisserial bacteria 

35 (2) a diagnostic reagent for detecting the presence of Neisserial bacteria or of antibodies raised against Neisserial 
bacteria, and/or (3) a reagent which can raise antibodies against Neisserial bacteria and (g) a method of treating a 
patient, comprising administering to the patient a therapeutically effective amount of these compositions. 

Sequences 

40 

[0092] The invention also provides a protein or a nucleic acid having any of the sequences set out in the following 
examples. It also provides proteins and nucleic acid having sequence identity to these. As described above, the degree 
of 'sequence identity' is preferably greater than 50% (eg. 60%, 70%, 80%, 90%, 95%, 99% or more). 
[0093] Furthermore, the invention provides nucleic acid which can hybridise to the nucleic acid disclosed in the ex- 
45 amples, preferably under "high stringency" conditions (eg. 65°C in a O.lxSSC, 0.5% SDS solution). 
[0094] The invention also provides nucleic acid encoding proteins according to the invention. 
[0095] It should also be appreciated that the invention provides nucleic acid comprising sequences complementary 
to those described above (eg. for antisense or probing purposes). 

[0096] Nucleic acid according to the invention can, of course, be prepared in many ways (eg. by chemical synthesis, 
50 from genomic or cDNA libraries, from the organism itself etc. ) and can take various forms (eg. single stranded, double 
stranded, vectors, probes etc. ). 

[0097] In addition, the term "nucleic acid" includes DNA and RNA, and also their analogues, such as those containing 
modified backbones, and also peptide nucleic acids (PNA) etc. 

55 BRIEF DESCRIPTION OF DRAWINGS 

[0098] 
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Figures 1 and 2 show constructs used to express proteins using heterologous leader peptides. 
Figure 3 shows expression data for ORF1 , and Figure 4 shows similar data for protein 961 . 
5 Figure 5 shows domains of protein 287, and Figures 6 & 7 show deletions within domain A. 

Figure 8 shows domains of protein 564. 

Figure 9 shows the PhoC reporter gene driven by the 91 9 leader peptide, and Figure 1 0 shows the results obtained 
10 using mutants of the leader peptide. 

Figure 1 1 shows insertion mutants of protein 730 (A: 730-C1 ; B: 730-C2). 

Figure 12 shows domains of protein 961 . 

15 

Figure 13 shows SDS-PAGE of AG proteins. Dots show the main recombinant product. 
Figure 14 shows 26 hybrid proteins according to the invention. 
20 MODES FOR CARRYING OUT THE INVENTION 
Example 1-919 and its leader peptide 

[0099] Protein 919 from N. meningitidis (serogroup B, strain 2996) has the following sequence: 

25 

1 MKKYLFRAAL YGIAAAILAA CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 
51 GTTVGGGGAV YTVVPHLSLP HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 
101 CAQAFQT PVH SFQAKQFFER YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR 
151 RTAQARFPIY GIPDDFISVP LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 
30 201 HTADLSRFPI TARTTAIKGR FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

251 EDPVELFFMH IQGSGRLKTP SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 
301 KLGQTSMQGI KAYMRQNPQR LAEVLGQNPS YIFFRELAGS SNDGPVGALG 
351 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 
4 01 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 

35 

[0100] The leader peptide is underlined. 

[0101] The sequences of 919 from other strains can be found in Figures 7 and 18 of WO00/66741 . 
[0102] Example 2 of WO99/57280 discloses the expression of protein 919 as a His-fusion in E.coii. 
[0103] The protein is a good surface-exposed immunogen. 
40 [0104] Three alternative expression strategies were used for 91 9: 

1) 919 without its leader peptide (and without the mature N-terminal cysteine) and without any fusion partner 
(>919untagged>) ; 

45 

1 QSKSIQTFP QPDTSVINGP DRPVGIPDPA GTTVGGGGAV YTVVPHLSLP 

50 HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV CAQAFQT PVH SFQAKQFFER 

100 YFTPWQVAGN GSLAGTVTGY YEPVLKGDDR RTAQARFPIY GIPDDFISVP 

150 LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT HTADLSRFPI TARTTAIKGR 

200 FEGSRFLPYH TRNQINGGAL DGKAPILGYA EDPVELFFMH IQGSGRLKTP 

50 250 SGKYIRIGYA DKNEHPYVSI GRYMADKGYL KLGQTSMQGI KAYMRQNPQR 

300 LAEVLGQNPS YIFFRELAGS SNDGPVGALG TPLMGEYAGA VDRHYITLGA 

350 PLFVATAHPV TRKALNRLIM AQDTGSAIKG AVRVDYFWGY GDEAGELAGK 

4 00 QKTTGYVWQL LPNGMKPEYR P* 



55 The leader peptide and cysteine were omitted by designing the 5'-end amplification primer downstream from the 

predicted leader sequence. 

2) 91 9 with its own leader peptide but without any fusion partner ('91 9L'); and 

3) 919 with the leader peptide (MKTFFKTLSAAALALILAA) from ORF4 ('91 9LOrf4'). 
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1 MKTFFKTLS AAALALILAA 

50 GTTVGGGGAV YTWPHLSLP 

100 CAQAFQTPVH SFQAKQFFER 

150 RTAQARFPIY GIPDDFISVP 

200 HTADLSRFPI TARTTAIKGR 

250 EDPVELFFMH IQGSGRLKTP 

300 KLGQTSMQGI KSYMRQNPQR 



CQSKSIQTFP QPDTSVINGP DRPVGIPDPA 

HWAAQDFAKS LQSFRLGCAN LKNRQGWQDV 

YFT PWQVAGN GSLAGTVTGY YEPVLKGDDR 

LPAGLRSGKA LVRIRQTGKN SGTIDNTGGT 

FEGSRFLPYH TRNQINGGAL DGKAPILGYA 

SGKYIRIGYA DKNEHPYVSI GRYMADKGYL 

LAEVLGQNPS YIFFRELAGS SNDGPVGALG 
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350 TPLMGEYAGA VDRHYITLGA PLFVATAHPV TRKALNRLIM AQDTGSAIKG 
400 AVRVDYFWGY GDEAGELAGK QKTTGYVWQL LPNGMKPEYR P* 



15 To make this construct, the entire sequence encoding the ORF4 leader peptide was included in the 5'-primer as a 

tail (primer 919Lorf4 For). A Nhe\ restriction site was generated by a double nucleotide change in the sequence 
coding for the ORF4 leader (no amino acid changes), to allow different genes to be fused to the ORF4 leader peptide 
sequence. A stop codon was included in all the 3'-end primer sequences. 

20 [0105] All three forms of the protein were expressed and could be purified. 

[0106] The '91 9L' and '91 9LOrf4' expression products were both lipidated, as shown by the incorporation of [ 3 H]-palmi- 
tate label. 919 unta 99 ed did not incorporate the 3 H label and was located intracellular^. 

[0107] 919LOrf4 could be purified more easily than 91 9L. It was purified and used to immunise mice. The resulting 
sera gave excellent results in FACS and ELISA tests, and also in the bactericidal assay. The lipoprotein was shown to 
25 be localised in the outer membrane. 

[0108] giguntagged g ave excellent ELISA titres and high serum bactericidal activity. FACS confirmed its cell surface 
location. 



Example 2 — 919 and expression temperature 

30 

[0109] Growth of E.coli expressing the 919LOrf4 protein at37°C resulted in lysis of the bacteria. In order to overcome 
this problem, the recombinant bacteria were grown at 30°C. Lysis was prevented without preventing expression. 



Example 3 - mutation of 907, 919 and 922 

35 

[0110] It was hypothesised that proteins 907, 919 and 922 are murein hydrolases, and more particularly lytic transg- 
lycosylases. Murein hydrolases are located on the outer membrane and participate in the degradation of peptidoglycan. 
[0111] The purified proteins 919 unta 9g ed , 919Lorf4, 919-His (i.e. with a C-terminus His-tag) and 922-His were thus 
tested for murein hydrolase activity [Ursinus & Holtje (1994) J.Bact. 176:338-343]. Two different assays were used, one 
40 determining the degradation of insoluble murein sacculus into soluble muropeptides and the other measuring breakdown 
of poly(MurNAc-GlcNAc) n>30 glycan strands. 

[0112] The first assay uses murein sacculi radiolabeled with meso-2,6-diamino-3,4,5-[ 3 H]pimelic acid as substrate. 
Enzyme (3-1 0 |xg total) was incubated for 45 minutes at 37°C in a total volume of 1 OOjjlI comprising 1 0mM Tris-maleate 
(pH 5.5), 10mM MgCI 2 , 0.2% v/v Triton X-100 and [ 3 H]A 2 pm labelled murein sacculi (about 10000cpm). The assay 

45 mixture was placed on ice for 15 minutes with 100 jxl of 1% w/v N-acetyl-N,N,N-trimethylammonium for 15 minutes and 
precipitated material pelleted by centrifugation at 10000g for 15 minutes. The radioactivity in the supernatant was 
measured by liquid scintillation counting. E.coli soluble lytic transglycosylase Slt70 was used as a positive control for 
the assay; the negative control comprised the above assay solution without enzyme. 
[0113] All proteins except 919-His gave positive results in the first assay. 

50 [01 14] The second assay monitors the hydrolysis of poly(MurNAc-GlcNAc)glycan strands. Purified strands, poly(Mur- 
NAc-GlcNAc) n>30 labelled with N-acetyl-D-1 -[ 3 H]glucosamine were incubated with 3^g of 919L in 10 mM Tris-maleate 
(pH 5.5), 1 0 mM MgCI 2 and 0.2% v/v Triton X-1 00 for 30 min at 37°C. The reaction was stopped by boiling for 5 minutes 
and the pH of the sample adjusted to about 3.5 by addition of 10^x1 of 20% v/v phosphoric acid. Substrate and product 
were separated by reversed phase HPLC on a Nucleosil 300 C 18 column as described by Harz et. al. [Anal. Biochem. 

55 (1990) 190:120-128]. The E.coli lytic transglycosylase Mlt A was used as a positive control in the assay. The negative 
control was performed in the absence of enzyme. 

[0115] By this assay, the ability of 919LOrf4 to hydrolyse isolated glycan strands was demonstrated when anhydro- 
disaccharide subunits were separated from the oligosaccharide by HPLC. 
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[0116] Protein 919Lorf4 was chosen for kinetic analyses. The activity of 919Lorf4 was enhanced 3.7-fold by the 
addition of 0.2% v/v Triton X-100 in the assay buffer. The presence of Triton X-100 had no effect on the activity of 
giguntagged jh e effect of pH on enzyme activity was determined in Tris-Maleate buffer over a range of 5.0 to 8.0. The 
optimal pH for the reaction was determined to be 5.5. Over the temperature range 1 8°C to 42°C, maximum activity was 
5 observed at 37°C. The effect of various ions on murein hydrolase activity was determined by performing the reaction in 
the presence of a variety of ions at a final concentration of 1 0mM. Maximum activity was found with Mg 2+ , which stimulated 
activity 2.1 -fold. Mn 2+ and Ca 2+ also stimulated enzyme activity to a similar extent while the addition Ni 2+ and EDTA 
had no significant effect. In contrast, both Fe 2+ and Zn 2+ significantly inhibited enzyme activity. 

[0117] The structures of the reaction products resulting from the digestion of unlabelled E.coli murein sacculus were 
10 analysed by reversed-phase HPLC as described by Glauner [Anal. Biochem. (1988) 172:451-464]. Murein sacculi 
digested with the muramidase Cellosyl were used to calibrate and standardise the Hypersil ODS column. The major 
reaction products were 1 ,6 anhydrodisaccharide tetra and tri peptides, demonstrating the formation of 1 ,6 anhydromu- 
raminic acid intramolecular bond. 

[0118] These results demonstrate experimentally that 919 is a murein hydrolase and in particular a member of the 
15 lytic transglycosylase family of enzymes. Furthermore the ability of 922-His to hydrolyse murein sacculi suggests this 
protein is also a lytic transglycosylase. 

[0119] This activity may help to explain the toxic effects of 919 when expressed in E.coli. 

[0120] In order to eliminate the enzymatic activity, rational mutagenesis was used. 907, 91 9 and 922 show fairly low 
homology to three membrane-bound lipidated murein lytic transglycosylases from E.coli: 

20 

919 (441 aa) is 27.3% identical over 440aa overlap to E.coli MLTA (P46885); 
922 (369aa) is 38.7% identical over 310aa overlap to E.coli MLTB (P41052); and 
907-2 (207aa) is 26.8% identical over 149aa overlap to E.coli MLTC (P52066). 

25 907-2 also shares homology with E.coli MLTD (P23931) and Slt70 (P03810), a soluble lytic transglycosylase that is 
located in the periplasmic space. No significant sequence homology can be detected among 919, 922 and 907-2, and 
the same is true among the corresponding MLTA, MLTB and MLTC proteins. 

[0121] Crystal structures are available for Slt70 [1QTEA; 1QTEB; Thunnissen et al. (1995) Biochemistry 34: 
12729-12737] and for Slt35 [1LTM; 1QUS; 1QUT; van Asselt et al. (1999) Structure Fold Des 7:1167-80] which is a 
30 soluble form of the 40kDa MLTB. 

[0122] The catalytic residue (a glutamic acid) has been identified for both Slt70 and MLTB. 

[0123] In the case of Slt70, mutagenesis studies have demonstrated that even a conservative substitution of the 
catalytic Glu505 with a glutamine (Gin) causes the complete loss of enzymatic activity. Although Slt35 has no obvious 
sequence similarity to Slt70, their catalytic domains shows a surprising similarity. The corresponding catalytic residue 
35 in MLTB is Glu162. 

[0124] Another residue which is believed to play an important role in the correct folding of the enzymatic cleft is a well- 
conserved glycine (Gly) downstream of the glutamic acid. Recently, Terrak et al. [Mol. Microbiol. (1 999) 34:350-64] have 
suggested the presence of another important residue which is an aromatic amino acid located around 70-75 residues 
downstream of the catalytic glutamic acid. 
40 [0125] Sequence alignment of Slt70 with 907-2 and of MLTB with 922 were performed in order to identify the corre- 
sponding catalytic residues in the MenB antigens. 

[0126] The two alignments in the region of the catalytic domain are reported below: 



907-2/Slt70: 

90 100 110 T120 130 140 

907-2 .pep ERRRLLVN IQYE SSRAG — LDTQIVLGLIEVESAFRQYAISGVGARGLMQVMPFWKNYIG 

II I I : : : I : :::::: I I I : : I III 1111 = 11 : : 
slty_ecoli ERFPLAYNDLFKRYTSGKEIPQSYAMAIARQESAWNPKVKSPVGASGLMQIMPGTATHTV 
480 490 500 A 510 520 530 

GLU505 



55 
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922/MLTB 

150 160 ▼ 170 180 190 200 

922 . pep VAQKYGVPAELIVAVIGIETNYGKNTGSFRVADALATLGFDYPRRAGFFQKELVELLKLA 
5 : I I I I I I : I I : : I I : I I : I : I : I : I I I I I I : I : I I I I I : I : I : I : I 

mltb_ecoli AWQVYGVPPEIIVGI IGVETRWGRVMGKTRILDALATLSFNYPRRAEYFSGELETFLLMA 
150 160 A 170 180 190 200 

GLU162 

210 220 230 240 250 260 

10 922 .pep KEEGGDVFAFKGSYAGAMGMPQFMPSSYRKWAVDYDGDGHRDIWGNVGDVAASVANYMKQ 

: : I I : : I I I : I I I I I : I I I I I I I : : : I I I : : I I I I : : I I I : : I I I I I : I 
mltb_ecoli RDEQDDPLNLKGSFAGAMGYGQFMPSSYKQYAVDFSGDGHINLWDPV-DAIGSVANYFKA 
210 220 230 240 250 260 

15 [0127] From these alignments, it results that the corresponding catalytic glutamate in 907-2 is Glu117, whereas in 
922 is Glu164. Both antigens also share downstream glycines that could have a structural role in the folding of the 
enzymatic cleft (in bold), and 922 has a conserved aromatic residue around 70aa downstream (in bold). 
[0128] In the case of protein 919, no 3D structure is available for its E.coli homologue MLTA, and nothing is known 
about a possible catalytic residue. Nevertheless, three amino acids in 91 9 are predicted as catalytic residues by alignment 

20 with MLTA: 



919/MLTA 

25 



240 250 T 260 □ □ 270 □ 280 290 

919. pep ALDGKAPILGYAEDPVELFFMHIQGSGRLKTPSGKYIRI-GYADKNEHPYVSIGRYMADK 
I I : I | | : | : : : : : | : I : I I I I : : I : : : : I I I I I I I I I : : I : 
mlta_ecoli . p ALSDKY-ILAYSNSLMDNFIMDVQGSGYIDFGDGSPLNFFSYAGKNGHAYRSIGKVLIDR 

170 180 190 200 210 



30 



300 310 320 T 330D m 340 0350 0 

35 919 .pep GYLKLGQTSMQGIKSYMRQNPQ-RLAEVLGQNPSYIFFRELAGSSNDGPV-GALGTPLMG 

1:1 : I I I : I ::::::: I : I I I I I :: I I : : : I I I I : : I I : I 

mlta_ecoli.p GEVKKEDMSMQAIRHWGETHSEAEVRELLEQNPSFVFFKPQSFA PVKGASAVPLVG 

220 230 240 250 260 270 

360 ▼ o 380 390 400 00410 

40 919. pep EYAGAVDRHYITLGAPLFVATAHPVTRKALN RLIMAQDTGSAIKGAVRVDYFWGY 

: : I I I I I : I : : : : : I I I : : I I : I : I I I I : I : I 

mlta_ecoli.p RASVASDRSIIPPGTTLLAEVPLLDNNGKFNGQYELRLMVALDVGGAIKGQ-HFDIYQGI 
280 290 300 310 320 330 

420 o 

45 919. pep GDEAGELAGKQKTTGYVWQLLP 

I I I I : I I : I I I I 
mlta_ecoli .p GPEAGHRAGWYNHYGRVWVLKT 
340 350 

50 [0129] The three possible catalytic residues are shown by the symbol ▼: 

1) Glu255 (Asp in MLTA), followed by three conserved glycines (Gly263, Gly265 and Gly272) and three conserved 
aromatic residues located approximately 75-77 residues downstream. These downstream residues are shown by o. 

55 2) Glu323 (conserved in MLTA), followed by 2 conserved glycines (Gly347 and Gly355) and two conserved aromatic 

residues located 84-85 residues downstream (Tyr406 or Phe407). These downstream residues are shown by O. 

3) Asp362 (instead of the expected Glu), followed by one glycine (Gly 369) and a conserved aromatic residue 
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(Trp428). These downstream residues are shown by O. 
[0130] Alignments of polymorphic forms of 91 9 are disclosed in WO00/66741 . 

[0131] Based on the prediction of catalytic residues, three mutants of the 919 and one mutant of 907, containing each 
a single amino acid substitution, have been generated. The glutamic acids in position 255 and 323 and the aspartic 
acids in position 362 of the 919 protein and the glutamic acid in position 1 17 of the 907 protein, were replaced with 
glycine residues using PCR-based SDM. To do this, internal primers containing a codon change from Glu or Asp to Gly 
were designed: 



Primers 


Sequences 


Codon change 


919-E255 for 
919-E255 rev 


CGAAGACCCCGTCGgtCTTTTTTTTATG 
GTGCATAAAAAAAAGacCGACGGGGTCT 


GAA -> Ggt 


919-E323 for 
919-E323 rev 


AACGCCTCGCCGgtGTTTTGGGTCA 
TTTGACCCAAAACacCGGCGAGGCG 


GAA -> Ggt 


919-D362 for 
919-D362 rev 


TGCCGGCGCAGTCGgtCGGCACTACA 
TAATGTAGTGCCGacCGACTGCGCCG 


GAC ^ Ggt 


907-E117for 
907-E117 rev 


TGATTGAGGTGGgtAGCGCGTTCCG 
GGCGGAACGCGCTacCCACCTCAAT 


GAA Ggt 


Underlined nucleotides code for glycine; the mutated nucleotides are in lowercase. 



[01 32] To generate the 91 9-E255, 91 9-E323 and 91 9-E362 mutants, PCR was performed using 20ng of the pET 91 9- 
LOrf4 DNA as template, and the following primer pairs: 

1) Orf4L for/919-E255 rev 

2) 919-E255 for/919L rev 

3) Orf4L for/919-E323 rev 

4) 919-E323 for/919L rev 

5) Orf4L for/919-D362 rev 

6) 919-D362 for/ 91 9L rev 

The second round of PCR was performed using the product of PCR 1-2, 3-4 or 5-6 as template, and as forward 
and reverse primers the "Orf4L for" and "91 9L rev" respectively. 

For the mutant 907-E117, PCR have been performed using 200ng of chromosomal DNA of the 2996 strain as 
template and the following primer pairs: 

7) 907L for/907-E117 rev 

8) 907-E 117for/907L rev 

[0133] The second round of PCR was performed using the products of PCR 7 and 8 as templates and the oligos "907L 
for" and "907L rev" as primers. 

[01 34] The PCR fragments containing each mutation were processed following the standard procedure, digested with 
A/del and Xho\ restriction enzymes and cloned into pET-21b+ vector. The presence of each mutation was confirmed by 
sequence analysis. 

[0135] Mutation of Glu1 17 to Gly in 907 is carried out similarly, as is mutation of residues Glu164, Ser213 and Asn348 
in 922. 

[0136] The E255G mutant of 919 shows a 50% reduction in activity; the E323G mutant shows a 70% reduction in 
activity; the E362G mutant shows no reduction in activity. 

Example 4 - multimeric form 

[0137] 287-GST, 919 unta 9g ed and 953-His were subjected to gel filtration for analysis of quaternary structure or pre- 
parative purposes. The molecular weight of the native proteins was estimated using either FPLC Superose 12 (H/R 
10/30) or Superdex 75 gel filtration columns (Pharmacia). The buffers used for chromatography for 287, 919 and 953 
were 50 mM Tris-HCI (pH 8.0), 20 mM Bicine (pH 8.5) and 50 mM Bicine (pH 8.0), respectively. 
[0138] Additionally each buffer contained 150-200 mM NaCI and 10% v/v glycerol. Proteins were dialysed against the 
appropriate buffer and applied in a volume of 200^1. Gel filtration was performed with a flow rate of 0.5 - 2.0 ml/min and 
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the eluate monitored at 280nm. Fractions were collected and analysed by SDS-PAGE. Blue dextran 2000 and the 
molecular weight standards ribonuclease A, chymotrypsin A ovalbumin, albumin (Pharmacia) were used to calibrate the 
column. The molecular weight of the sample was estimated from a calibration curve of K av vs. log M r of the standards. 
Before gel filtration, 287-GST was digested with thrombin to cleave the GST moiety. 
5 [0139] The estimated molecular weights for 287, 919 and 953-His were 73 kDa, 47 kDa and 43 kDa respectively. 
These results suggest 919 is monomeric while both 287 and 953 are principally dimeric in their nature. In the case of 
953-His, two peaks were observed during gel filtration. The major peak (80%) represented a dimeric conformation of 
953 while the minor peak (20%) had the expected size of a monomer. The monomeric form of 953 was found to have 
greater bactericidal activity than the dimer. 

10 

Example 5 - pSM214 and pET-24b vectors 

[0140] 953 protein with its native leader peptide and no fusion partners was expressed from the pET vector and also 
from pSM214 [Velati Bellini et al. (1991) J. Biotechnol. 18, 177-192]. 
15 [0141] The 953 sequence was cloned as a full-length gene into pSM214 using the E. coli MM294-1 strain as a host. 
To do this, the entire DNA sequence of the 953 gene (from ATG to the STOP codon) was amplified by PCR using the 
following primers: 

953L for/2 CCG GAATTC TTATGAAAAAAATCATCTTCGCCGC Eco Rl 
953L rev/2 GCCC AAGCTT TTATTGTTTGGCTGCCTCGATT Hind III 
20 which contain EcoRI and Hind\\\ restriction sites, respectively. The amplified fragment was digested with EcoRI and 
Hind\\\ and ligated with the pSM21 4 vector digested with the same two enzymes. The ligated plasmid was transformed 
into E.cofi MM 294-1 cells (by incubation in ice for 65 minutes at 37° C) and bacterial cells plated on LB agar containing 
20|xg/ml of chloramphenicol. 

[0142] Recombinant colonies were grown over-night at 37°C in 4 ml of LB broth containing 20 jxg/ml of chloramphenicol; 

25 bacterial cells were centrifuged and plasmid DNA extracted as and analysed by restriction with EcoRI and Hind\\\. To 
analyse the ability of the recombinant colonies to express the protein, they were inoculated in LB broth containing 
20fxg/ml of chloramphenicol and let to grown for 1 6 hours at 37°C. Bacterial cells were centrifuged and resuspended in 
PBS. Expression of the protein was analysed by SDS-PAGE and Coomassie Blue staining. 
[0143] Expression levels were unexpectedly high from the pSM214 plasmid. 

30 [0144] Oligos used to clone sequences into pSM-21 4 vectors were as follows: 



35 



40 



45 



50 



55 



AG287 
(pSM-214) 


Fwd 


CCGGAATTCTTATG-TCGCCCGATGTTAAATCGGCGGA 


EcoRI 


Rev 




Hindlll 


A2 287 
(pSM-214) 


Fwd 


CCGGAATTCTTATG-AGCCAAGATATGGCGGCAGT 


EcoRI 


Rev 


GCCC AAG CTT-TCAATCCTGCTCTTTTTTG CCG 


Hindlll 


A3 287 
(pSM-214) 


Fwd 


CCGGAATTCTTATG-TCCGCCGAATCCGCAAATCA 


EcoRI 


Rev 


GCCC AAG CTT-TCAATCCTGCTCTTTTTTG CCG 


Hindlll 


A4 287 
(pSM-214) 


Fwd 


CCG G AATTCTTATG -G G AAG G G TTG ATTTG G CT AATG 


EcoRI 


Rev 


GCCC AAG CTT-TCAATCCTGCTCTTTTTTG CCG 


Hindlll 


Orf46.1 
(pSM-214) 


Fwd 


CCGGAATTCTTATG-TCAGATTTGGCAAACGATTCTT 


EcoRI 


Rev 


GCCCAAGCTT-TTACGTATCATATTTCACGTGCTTC 


Hindlll 


AG287-Orf46.1 
(pSM-214) 


Fwd 


CCGGAATTCTTATG-TCGCCCGATGTTAAATCGGCGGA 


EcoRI 


Rev 


GCCCAAGCTT-TTACGTATCATATTTCACGTGCTTC 


Hindlll 


919 
(pSM-214) 


Fwd 


CCGGAATTCTTATG-CAAAGCAAGAGCATCCAAACCT 


EcoRI 


Rev 


GCCCAAGCTT-TTACGGGCGGTATTCGGGCT 


Hindlll 


961 L 
(pSM-214) 


Fwd 


CCGGAATTCATATG-AAACACTTTCCATCC 


EcoRI 


Rev 


GCCCAAGCTT-TTACCACTCGTAATTGAC 


Hindlll 


961 
(pSM-214) 


Fwd 


CCGGAATTCATATG-GCCACAAGCGACGAC 


EcoRI 


Rev 


GCCCAAGCTT-TTACCACTCGTAATTGAC 


Hindlll 
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(continued) 



5 



961c L 
pSM-214 


Fwd 


CCGGAATTCTTATG-AAACACTTTCCATCC 


EcoRI 


Rev 


GCCCAAGCTT-TCAACCCACGTTGTAAGGTTG 


Hindlll 


961c 
pSM-214 


Fwd 


CCGGAATTCTTATG-GCCACAAACGACGACG 


EcoRI 


Rev 


GCCCAAGCTT-TCAACCCACGTTGTAAGGTTG 


Hindlll 


953 
(pSM-214) 


Fwd 


CCGGAATTCTTATG-GCCACCTACAAAGTGGACGA 


EcoRI 


Rev 


GCCCAAGCTT-TTATTGTTTGGCTGCCTCGATT 


Hindlll 



These sequences were manipulated, cloned and expressed as described for 953L. 

[0145] For the pET-24 vector, sequences were cloned and the proteins expressed in pET-24 as described below for 
pET21. pET2 has the same sequence as pET-21, but with the kanamycin resistance cassette instead of ampicillin 
cassette. 

[0146] Oligonucleotides used to clone sequences into pET-24b vector were: 





AG 287 K 


Fwd 


CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC § 


Nhel 


20 




Rev 


CCCGCTCGAG-TCAATCCTGCTCTTTTTTGCC * 


Xhol 




A2 287 K 


Fwd 


CGCGGATCCGCTAGC-CAAGATATGGCGGCAGT § 


Nhel 




A3 287 K 


Fwd 


CGCGGATCCGCTAGC-GCCGAATCCGCAAATCA § 


Nhel 


25 


A4 287 K 


Fwd 


CGCGCTAGC-GGAAGGGTTGATTTGGCTAATGG § 


Nhel 




Orf46.1 K 


Fwd 


GGGAATTCCATATG-GGCATTTCCCGCAAAATATC 


Ndel 






Rev 


CCCGCTCGAG-TTACGTATCATATTTCACGTGC 


Xhol 




Orf46A K 


Fwd 


GGGAATTCCATATG-GGCATTTCCCGCAAAATATC 


Ndel 


30 




Rev 


CCCGCTCGAG-TTATTCTATGCCTTGTGCGGCAT 


Xhol 




961 K 


Fwd 


CGCGGATCCCATATG-GCCACAAGCGACGACGA 


Ndel 




(MC58) 


Rev 


CCCGCTCGAG-TTACCACTCGTAATTGAC 


Xhol 


35 


961a K 


Fwd 


CGCGGATCCCATATG-GCCACAAACGACG 


Ndel 






Rev 


CCC G CTCG AG -TC ATTT AG C AAT ATT ATCTTTGTTC 


Xhol 




961b K 


Fwd 


CGCGGATCCCATATG-AAAGCAAACAGTGCCGAC 


Ndel 






Rev 


CCCGCTCGAG-TTACCACTCGTAATTGAC 


Xhol 


40 


961c K 


Fwd 


CGCGGATCCCATATG-GCCACAAACGACG 


Ndel 






Rev 


CCCGCTCGAG-TTAACCCACGTTGTAAGGT 


Xhol 




961cL K 


Fwd 


CGCGGATCCCATATG-ATGAAACACTTTCCATCC 


Ndel 


45 




Rev 


CCCGCTCGAG-TTAACCCACGTTGTAAGGT 


Xhol 




961d K 


Fwd 


CGCGGATCCCATATG-GCCACAAACGACG 


Ndel 






Rev 


CCCGCTCGAG-TCAGTCTGACACTGTTTTATCC 


Xhol 




AG 287- 


Fwd 


CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC 


Nhel 


50 


919 K 


Rev 


CCCGCTCGAG-TTACGGGCGGTATTCGG 


Xhol 




AG 287- 


Fwd 


CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC 


Nhel 




Orf46.1 K 


Rev 


CCCGCTCGAG-TTACGTATCATATTTCACGTGC 


Xhol 


55 


AG 287- 


Fwd 


CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC 


Nhel 



14 
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(continued) 




961 K 


Rev 


CCCGCTCGAG-TTACCACTCGTAATTGAC 


Xhol 


* This primer was used as a Reverse primer for all the 287 forms. 

§ Forward primers used in combination with the AG278 K reverse primer. 



Example 6 - ORF1 and its leader peptide 

[0147] ORF1 from N. meningitidis (serogroup B, strain MC58) is predicted to be an outer membrane or secreted protein. 
It has the following sequence: 

1 MKTTDKRTTE THRKAPKTGR IRFSPAYLAI CLSFGILPQA WAGHTYFGIN 

51 YQYYRDFAEN KGKFAVGAKD IEVYNKKGEL VGKSMTKAPM IDFSVVSRNG 

101 VAALVGDQYI VSVAHNGGYN NVDFGAEGRN PDQHRFTYKI VKRNNYKAGT 

151 KGHPYGGDYH MPRLHKFVTD AEPVEMTSYM DGRKYIDQNN YPDRVRIGAG 

201 RQYWRSDEDE PNNRESSYHI ASAYSWLVGG NTFAQNGSGG GTVNLGSEKI 

251 KHSPYGFLPT GGSFGDSGSP MFIYDAQKQK WLINGVLQTG NPYIGKSNGF 

301 QLVRKDWFYD EIFAGDTHSV FYEPRQNGKY SFNDDNNGTG KINAKHEHNS 

351 LPNRLKTRTV QLFNVSLSET ARE PV YHAAG GVNSYRPRLN NGENISFIDE 

401 GKGELILTSN INQGAGGLYF QGDFTVSPEN NETWQGAGVH ISEDSTVTWK 

451 VNGVANDRLS KIGKGTLHVQ AKGENQGSIS VGDGTVILDQ QADDKGKKQA 

501 FSEIGLVSGR GTVQLNADNQ FNPDKLYFGF RGGRLDLNGH SLSFHRIQNT 

551 DEGAMIVNHN QDKESTVTIT GNKDIATTGN NNSLDSKKEI AYNGWFGEKD 

601 TTKTNGRLNL VYQPAAEDRT LLLSGGTNLN GNITQTNGKL FFSGRPTPHA 

651 YNHLNDHWSQ KEGIPRGEIV WDNDWINRTF KAENFQIKGG QAVVSRNVAK 

701 VKGDWHLSNH AQAVFGVAPH QSHTICTRSD WTGLTNCVEK TITDDKVIAS 

751 LTKTDISGNV DLADHAHLNL TGLATLNGNL SANGDTRYTV SHNATQNGNL 

801 SLVGNAQATF NQATLNGNTS ASGNASFNLS DHAVQNGSLT LSGNAKANVS 

851 HSALNGNVSL ADKAVFHFES SRFTGQISGG KDTALHLKDS EWTLPSGTEL 

901 GNLNLDNATI TLNSAYRHDA AGAQTGSATD APRRRSRRSR RSLLSVTPPT 

951 SVESRFNTLT VNGKLNGQGT FRFMSELFGY RSDKLKLAES SEGTYTLAVN 

1001 NTGNEPASLE QLTVVEGKDN KPLSENLNFT LQNEHVDAGA WRYQLIRKDG 
1051 EFRLHNPVKE QELSDKLGKA EAKKQAEKDN AQSLDALIAA GRDAVEKTES 

1101 VAEPARQAGG ENVGIMQAEE EKKRVQADKD TALAKQREAE TRPATTAFPR 

1151 ARRARRDLPQ LQPQPQPQPQ RDLISRYANS GLSEFSATLN SVFAVQDELD 

1201 RVFAEDRRNA VWTSGIRDTK HYRSQDFRAY RQQTDLRQIG MQKNLGSGRV 

1251 GILFSHNRTE NTFDDGIGNS ARLAHGAVFG QYGIDRFYIG ISAGAGFSSG 

1301 SLSDGIGGKI RRRVLHYGIQ ARYRAGFGGF GIEPHIGATR YFVQKADYRY 

1351 ENVNIATPGL AFNRYRAGIK ADYSFKPAQH ISITPYLSLS YTDAASGKVR 

1401 TRVNTAVLAQ DFGKTRSAEW GVNAEIKGFT LSLHAAAAKG PQLEAQHSAG 
1451 IKLGYRW* 

The leader peptide is underlined. 

[0148] A polymorphic form of ORF1 is disclosed in W099/55873. 

[0149] Three expression strategies have been used for ORF1 : 



1 ) ORF1 using a His tag, following W099/24578 (ORF1 -His); 

2) ORF1 with its own leader peptide but without any fusion partner ('ORF1 U); and 

3) ORF1 with the leader peptide (M KKTAI Al AVALAG FATVAQ A) from E.coli OmpA ('Orf1 LOmpA'): 
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MKKTAIAIAVALAGFATVAQA ASAGHTYFGINYQYYRDFAENKGKFAVGAKDIEVYNKKGELVGKSMTKAPMIDFSV 
VSRNGVAALVGDQYIVSVAHNGGYNNVDFGAEGRNPDQHRFTYKIVKRNNYKAGTKGHPYGGDYHMPRLHKFVTDAE 
PVEMTSYMDGRKYIDQNNYPDRVRIGAGRQYWRSDEDEPNNRESSYHIASAYSWLVGGNTFAQNGSGGGTVNLGSEK 
IKHSPYGFLPTGGSFGDSGSPMFIYDAQKQKWLINGVLQTGN PYIGKSNGFQLVRKDWFYDEIFAGDTHSVFYEPRQ 
NGKYSFNDDNNGTGKINAKHEHNSLPNRLKTRTVQLFNVSLSETAREPVYHAAGGVNSYRPRLNNGENISFIDEGKG 
ELILTSNINQGAGGLYFQGDFTVSPENNETWQGAGVHISEDSTVTWKVNGVANDRLSKIGKGTLHVQAKGENQGSIS 
VGDGTVILDQQADDKGKKQAFSEIGLVSGRGTVQLNADNQFNPDKLYFGFRGGRLDLNGHSLSFHRIQNTDEGAMIV 
NHNQDKESTVTITGNKDIATTGNNNSLDSKKEIAYNGW FGEKDTTKTNGRLNLVYQPAAEDRTLLLSGGTNLNGNIT 
QTNGKLFFSGRPTPHAYNHLNDHWSQKEGIPRGEIVWDN DWINRTFKAENFQIKGGQAWSRNVAKVKGDWHLSNHA 
QAVFGVAPHQSHTICTRSDWTGLTNCVEKTITDDKVIASLTKTDISGNVDLADHAHLNLTGLATLNGNLSANGDTRY 
TVSHNATQNGNLSLVGNAQATFNQATLNGNTSASGNASFNLSDHAVQNGSLTLSGNAKANVSHSALNGNVSLADKAV 
FHFESSRFTGQISGGKDTALHLKDSEWTLPSGTELGNLNLDNATITLNSAYRHDAAGAQTGSATDAPRRRSRRSRRS 
LLSVTPPTSVESRFNTLTVNGKLNGQGTFRFMSELFGYRSDKLKLAESSEGTYTLAVNNTGNEPASLEQLTVVEGKD 
NKPLSENLN FTLQNEHVDAGAWRYQLIRKDGEFRLHNPVKEQELSDKLGKAEAKKQAEKDNAQSLDALIAAGRDAVE 
KTESVAEPARQAGGENVGIMQAEEEKKRVQADKDTALAKQREAETRPATTAFPRARRARRDLPQLQPQPQPQPQRDL 
ISRYANSGLSEFSATLNSVFAVQDELDRVFAEDRRNAVWTSGIRDTKHYRSQDFRAYRQQTDLRQIGMQKNLGSGRV 
GILFSHNRTENTFDDGIGNSARLAHGAVFGQYGIDRFYIGISAGAGFSSGSLSDGIGGKIRRRVLHYGIQARYRAGF 
GGFGIEPHIGATRYFVQKADYRYENVNIATPGLAFNRYRAGIKADYSFKPAQHISITPYLSLSYTDAASGKVRTRVN 
TAVLAQDFGKTRS AEWGVNAE I KG FTL S LHAAAAKG PQLEAQH SAG I KLG YRW* 



To make this construct, the clone pET91 1 LOmpA (see below) was digested with the Nhe\ and Xho\ restriction 
enzymes and the fragment corresponding to the vector carrying the OmpA leader sequence was purified (pETLOm- 
pA). The ORF1 gene coding for the mature protein was amplified using the oligonucleotides ORF1 -For and ORF1 - 
Rev (including the Nhe\ and Xho\ restriction sites, respectively), digested with Nhe\ and Xho\ and ligated to the 
purified pETOmpA fragment (see Figure 1). An additional AS dipeptide was introduced by the Nhe\ site. 

[0150] All three forms of the protein were expressed. The His-tagged protein could be purified and was confirmed as 
surface exposed, and possibly secreted (see Figure 3). The protein was used to immunise mice, and the resulting sera 
gave excellent results in the bactericidal assay. 

[0151] ORFILOmpA was purified as total membranes, and was localised in both the inner and outer membranes. 
Unexpectedly, sera raised against ORFILOmpA show even better ELISA and anti-bactericidal properties than those 
raised against the His-tagged protein. 

[0152] ORF1 L was purified as outer membranes, where it is localised. 
Example 7 - protein 911 and its leader peptide 

[0153] Protein 91 1 from N. meningitidis (serogroup B, strain MC58) has the following sequence: 

1 MKKNILE FWV GLFVLIGAAA VAFL AFRVAG GAAFGGSDKT YAVYADFGDI 

51 GGLKVNAPVK SAGVLVGRVG AIGLDPKSYQ ARVRLDLDGK YQFSSDVSAQ 

101 ILTSGLLGEQ YIGLQQGGDT ENLAAGDTIS VTSSAMVLEN LIGKFMTSFA 

151 EKNADGGNAE KAAE* 

The leader peptide is underlined. 

[0154] Three expression strategies have been used for 91 1 : 

1) 911 with its own leader peptide but without any fusion partner ('91 1 L'); 

2) 91 1 with the leader peptide from E.co//OmpA ('91 1 LOmpA'). To make this construct, the entire sequence encoding 
the OmpA leader peptide was included in the 5'- primer as a tail (primer 91 1 LOmpA Forward). A Nhe\ restriction 
site was inserted between the sequence coding for the OmpA leader peptide and the 91 1 gene encoding the 
predicted mature protein (insertion of one amino acid, a serine), to allow the use of this construct to clone different 
genes downstream the OmpA leader peptide sequence. 

3) 91 1 with the leader peptide (MKYLLPTAAAGLLLAAQPAMA) from Erwinia carotovora PelB ('91 1 LpelB'). 

[0155] To make this construct, the 5'-end PCR primer was designed downstream from the leader sequence and 
included the Nco\ restriction site in order to have the 91 1 fused directly to the PelB leader sequence; the 3'- end primer 
included the STOP codon. The expression vector used was pET22b+ (Novagen), which carries the coding sequence 



16 
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for the PelB leader peptide. The Nco\ site introduces an additional methionine after the PelB sequence. 

[0156] All three forms of the protein were expressed. ELISA titres were highest using 911 L, with 919LOmpA also 

giving good results. 

5 Example 8 - ORF46 

[0157] The complete ORF46 protein from N. meningitidis (serogroup B, strain 2996) has the following sequence: 



10 1 LGISRKISLI LSILAVCLPM HAHA SDLAND SFIRQVLDRQ HFEPDGKYHL 

51 FGSRGELAER SGHIGLGKIQ SHQLGNLMIQ QAAIKGNIGY IVRFSDHGHE 

101 VHSPFDNHAS HSDSDEAGSP VDGFSLYRIH WDGYEHHPAD GYDGPQGGGY 

151 PAPKGARDIY SYDIKGVAQN IRLNLTDNRS TGQRLADRFH NAGSMLTQGV 

201 GDGFKRATRY SPELDRSGNA AEAFNGTADI VKNIIGAAGE IVGAGDAVQG 

2 51 ISEGSNIAVM HGLGLLSTEN KMARINDLAD MAQLKDYAAA AIRDWAVQNP 

15 301 NAAQGIEAVS NIFMAAIPIK GIGAVRGKYG LGGITAHPIK RSQMGAIALP 

351 KGKSAVSDNF ADAAYAKYPS PYHSRNIRSN LEQRYGKENI TSSTVPPSNG 

4 01 KNVKLADQRH PKTGVPFDGK GFPNFEKHVK YDTKLDIQEL SGGGIPKAKP 

4 51 VSDAKPRWEV DRKLNKLTTR EQVEKNVQEI RNGNKNSNFS QHAQLEREIN 

501 KLKSADEINF ADGMGKFTDS MNDKAFSRLV KSVKENGFTN PVVEYVEING 

551 KAYIVRGNNR VFAAEYLGRI HELKFKKVDF PVPNTSWKNP TDVLNESGNV 

20 601 KRPRYRSK* 



The leader peptide is underlined. 

[0158] The sequences of ORF46 from other strains can be found in WO00/66741 . 
25 [0159] Three expression strategies have been used for ORF46: 

1) ORF46 with its own leader peptide but without any fusion partner ('ORF46-2L'); 

2) ORF46 without its leader peptide and without any fusion partner ('ORF46-2'), with the leader peptide omitted by 
designing the 5'-end amplification primer downstream from the predicted leader sequence: 



1 SDLANDSFIR QVLDRQHFEP DGKYHLFGSR GELAERSGHI GLGKIQSHQL 

51 GNLMIQQAAI KGNIGYIVRF SDHGHEVHSP FDNHASHSDS DEAGSPVDGF 

101 SLYRIHWDGY EHHPADGYDG PQGGGYPAPK GARDIYSYDI KGVAQNIRLN 

151 LTDNRSTGQR LADRFHNAGS MLTQGVGDGF KRATRYSPEL DRSGNAAEAF 

201 NGTADIVKNI IGAAGEIVGA GDAVQGISEG SNIAVMHGLG LLSTENKMAR 

251 INDLADMAQL KDYAAAAIRD WAVQNPNAAQ GIEAVSNIFM AAIPIKGIGA 

301 VRGKYGLGGI TAHPIKRSQM GAIALPKGKS AVSDNFADAA YAKYPSPYHS 

351 RNIRSNLEQR YGKENITSST VPPSNGKNVK LADQRHPKTG VPFDGKGFPN 

401 FEKHVKYDTK LDIQELSGGG IPKAKPVSDA KPRWEVDRKL NKLTTREQVE 

40 4 51 KNVQEIRNGN KNSNFSQHAQ LEREINKLKS ADEINFADGM GKFTDSMNDK 

501 AFSRLVKSVK ENGFTNPWE YVEINGKAYI VRGNNRVFAA EYLGRIHELK 

551 FKKVDFPVPN TSWKNPTDVL NESGNVKRPR YRSK* 



45 3) ORF46 as a truncated protein, consisting of the first 433 amino acids ('ORF46.1 L'), constructed by designing 

PCR primers to amplify a partial sequence corresponding to aa 1-433. 
A STOP codon was included in the 3'-end primer sequences. 

[0160] ORF46-2L is expressed at a very low level to E. coii. Removal of its leader peptide (ORF46-2) does not solve 
50 this problem. The truncated ORF46.1 L form (first 433 amino acids, which are well conserved between serogroups and 
species), however, is well-expressed and gives excellent results in ELISA test and in the bactericidal assay. 
[0161] ORF46.1 has also been used as the basis of hybrid proteins. It has been fused with 287, 919, and ORF1 . The 
hybrid proteins were generally insoluble, but gave some good ELISA and bactericidal results (against the homologous 
2996 strain): 

55 



Protein 


ELISA 


Bactericidal Ab 


Orf1-Orf46.1-His 


850 


256 
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(continued) 



5 



Protein 


ELISA 


Bactericidal Ab 


919-Orf46.1-His 


12900 


512 


919-287-Orf46-His 


n.d. 


n.d. 


Orf46.1-287His 


150 


8192 


Orf46.1-919His 


2800 


2048 


Orf46.1 -287-91 9His 


3200 


16384 



[0162] For comparison, 'triple' hybrids of ORF46.1, 287 (either as a GST fusion, or in AG287 form) and 919 were 
constructed and tested against various strains (including the homologous 2996 strain) versus a simple mixture of the 
three antigens. FCA was used as adjuvant: 



20 





2996 


BZ232 


MC58 


NGH38 


F6124 


BZ133 


Mixture 


8192 


256 


512 


1024 


>2048 


>2048 


ORF46.1 -287-91 9his 


16384 


256 


4096 


8192 


8192 


8192 


AG287-919-ORF46.1his 


8192 


64 


4096 


8192 


8192 


1 6384 


AG287-ORF46.1-919his 


4096 


128 


256 


8192 


512 


1024 



25 Again, the hybrids show equivalent or superior immunological activity. 

[0163] Hybrids of two proteins (strain 2996) were compared to the individual proteins against various heterologous 
strains: 





1000 


MC58 


F6124 (MenA) 


ORF46.1-His 


<4 


4096 


<4 


ORF1-His 


8 


256 


128 


ORF1— ORF46.1-His 


1024 


512 


1024 



35 

[0164] Again, the hybrid shows equivalent or superior immunological activity. 
Example 9 - protein 961 

40 [0165] The complete 961 protein from N. meningitidis (serogroup B, strain MC58) has the following sequence: 



1 MSMKHFPAKV LTTAILATFC SGALA ATSDD DVKKAATVAI VAAYNNGQEI 

51 NGFKAGETIY DIGEDGTITQ KDATAADVEA DDFKGLGLKK WTNLTKTVN 

45 101 ENKQNVDAKV KAAESEIEKL TTKLADTDAA LADTDAALDE TTNALNKLGE 

151 NITTFAEETK TNIVKIDEKL EAVADTVDKH AEAFNDIADS LDETNTKADE 

201 AVKTANEAKQ TAEETKQNVD AKVKAAETAA GKAEAAAGTA NTAADKAEAV 

2 51 AAKVTDIKAD IATNKADIAK NSARIDSLDK NVANLRKETR QGLAEQAALS 

301 GLFQPYNVGR FNVTAAVGGY KSESAVAIGT GFRFTENFAA KAGVAVGTSS 

351 GSSAAYHVGV NYEW* 

50 



[0166] The leader peptide is underlined. 

[0167] Three approaches to 961 expression were used: 

1 ) 961 using a GST fusion, following WO99/57280 ('GST961 '); 

2) 961 with its own leader peptide but without any fusion partner ('961 U); and 

3) 961 without its leader peptide and without any fusion partner ('961 untagged,^ with the leader peptide omitted by 
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designing the 5' -end PCR primer downstream from the predicted leader sequence. 

[0168] All three forms of the protein were expressed. The GST-fusion protein could be purified and antibodies against 
it confirmed that 961 is surface exposed (Figure 4). The protein was used to immunise mice, and the resulting sera gave 
5 excellent results in the bactericidal assay. 961 L could also be purified and gave very high ELISA titres. 

[0169] Protein 961 appears to be phase variable. Furthermore, it is not found in all strains of N. meningitidis. 

Example 10- protein 287 

10 [0170] Protein 287 from N. meningitidis (serogroup B, strain 2996) has the following sequence: 



1 MFERSVIAMA CIFALSA CGG 

51 EDAPQAGSQG QGAPSTQGSQ 

101 DMPQNSAESA NQTGNNQPAD 

151 DGPSQNITLT HCKGDSCNGD 



GGGGSPDVKS ADTLSKPAAP VVAEKETEVK 
DMAAVSAENT GNGGAATTDK PKNEDEGPQN 
SSDSAPASNP APANGGSNFG RVDLANGVLI 
NLLDEEAPSK SEFENLNESE RIEKYKKDGK 



201 SDKFTNLVAT AVQANGTNKY 

251 PLIPVNQADT LIVDGEAVSL 

301 LRVQGEPAKG EMLAGTAVYN 

351 VDGIIDSGDD LHMGTQKFKA 

4 01 VAGKYSYRPT DAEKGGFGVF 



VIIYKDKSAS SSSARFRRSA RSRRSLPAEM 

TGHSGNIFAP EGNYRYLTYG AEKLPGGSYA 

GEVLHFHTEN GRPYPTRGRF AAKVDFGSKS 

AIDGNGFKGT WTENGGGDVS GRFYGPAGEE 
AGKKEQD* 



25 

[0171] The leader peptide is shown underlined. 

[0172] The sequences of 287 from other strains can be found in Figures 5 and 15 of WO00/66741 . 
[0173] Example 9 of WO99/57280 discloses the expression of 287 as a GST-fusion in E.coii. 
[0174] A number of further approaches to expressing 287 in E.coli have been used, including: 

30 

1) 287 as a His-tagged fusion ('287-His'); 

2) 287 with its own leader peptide but without any fusion partner ('287L'); 

3) 287 with the ORF4 leader peptide and without any fusion partner ('287LOrf4'); and 

4) 287 without its leader peptide and without any fusion partner ('287 unta g9 ed '): 



1 CGGGGGGSPD VKSADTLSKP AAPVVAEKET EVKEDAPQAG SQGQGAPSTQ 

51 GSQDMAAVSA ENTGNGGAAT TDKPKNEDEG PQNDMPQNSA ESANQTGNNQ 

101 PADSSDSAPA SNPAPANGGS NFGRVDLANG VLIDGPSQNI TLTHCKGDSC 

40 151 NGDNLLDEEA PSKSEFENLN ESERIEKYKK DGKSDKFTNL VATAVQANGT 

201 NKYVIIYKDK SASSSSARFR RSARSRRSLP AEMPLIPVNQ ADTLIVDGEA 

251 VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP AKGEMLAGTA 

301 VYNGEVLHFH TENGRPYPTR GRFAAKVDFG SKSVDGIIDS GDDLHMGTQK 

351 FKAAIDGNGF KGTWTENGGG DVSGRFYGPA GEEVAGKYSY RPTDAEKGGF 

401 GVFAGKKEQD * 

45 



[0175] All these proteins could be expressed and purified. 
[0176] '287L' and '287LOrf4' were confirmed as lipoproteins. 

[0177] As shown in Figure 2, '287LOrf4' was constructed by digesting 91 9LOrf4 with Nhe\ and Xho\. The entire ORF4 
50 leader peptide was restored by the addition of a DNA sequence coding for the missing amino acids, as a tail, in the 5'- 
end primer (287LOrf4 for), fused to 287 coding sequence. The 287 gene coding for the mature protein was amplified 
using the oligonucleotides 287LOrf4 For and Rev (including the Nhe\ and Xho\ sites, respectively), digested with Nhe\ 
and Xho\ and ligated to the purified pETOrf4 fragment. 

55 Example 11 - further non- fusion proteins with/ without native leader peptides 

[0178] A similar approach was adopted for E.coii expression of further proteins from W099/24578, W099/36544 and 
WO99/57280. 
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[0179] The following were expressed without a fusion partner: 008, 105, 117-1, 121-1 , 122-1 , 128-1 , 148, 216, 243, 
308, 593, 652, 726, 982, and Orf1 43-1 . Protein 117-1 was confirmed as surface-exposed by FACS and gave high ELISA 
titres. 

[0180] The following were expressed with the native leader peptide but without a fusion partner: 1 1 1 , 149, 206, 225-1 , 
5 235, 247-1 , 274, 283, 286, 292, 401 , 406, 502-1 , 503, 51 9-1 , 525-1 , 552, 556, 557, 570, 576-1 , 580, 583, 664, 759, 
907, 913, 920-1 , 926, 936-1 , 953, 961 , 983, 989, Orf4, Orf7-1 , Orf9-1 , Orf23, Orf25, Orf37, Orf38, Orf40, Orf40.1 , Orf40.2, 
Orf72-1 , Orf76-1 , Orf85-2, Orf91 , Orf97-1 , Orf1 19, Orf143.1 . These proteins are given the suffix 'L\ 
[0181] His-tagged protein 760 was expressed with and without its leader peptide. The deletion of the signal peptide 
greatly increased expression levels. The protein could be purified most easily using 2M urea for solubilisation. 
10 [0182] His-tagged protein 264 was well-expressed using its own signal peptide, and the 30kDa protein gave positive 
Western blot results. 

[0183] All proteins were successfully expressed. 

[0184] The localisation of 593, 121-1 , 128-1 , 593, 726, and 982 in the cytoplasm was confirmed. 
[0185] The localisation of 920-1 L, 953L, ORF9-1 L, ORF85-2L, ORF97-1 L, 570L, 580L and 664L in the periplasm was 
15 confirmed. 

[0186] The localisation of ORF40L in the outer membrane, and 008 and 519-1 L in the inner membrane was confirmed. 
ORF25L, ORF4L, 406L, 576-1 L were all confirmed as being localised in the membrane. 
[0187] Protein 206 was found not to be a lipoprotein. 

[0188] ORF25 and ORF40 expressed with their native leader peptides but without fusion partners, and protein 593 
20 expressed without its native leader peptide and without a fusion partner, raised good anti-bactericidal sera. Surprisingly, 
the forms of ORF25 and ORF40 expressed without fusion partners and using their own leader peptides (i.e. 'ORF25L' 
and 'ORF40L') give better results in the bactericidal assay than the fusion proteins. 

[0189] Proteins 920L and 953L were subjected to N-terminal sequencing, giving HRVWVETAH and ATYKVDEY- 
HANARFAF, respectively. This sequencing confirms that the predicted leader peptides were cleaved and, when combined 
25 with the periplasmic location, confirms that the proteins are correctly processed and localised by E.co//when expressed 
from their native leader peptides. 

[0190] The N-terminal sequence of protein 51 9.1 L localised in the inner membrane was MEFFIILLA, indicating that 
the leader sequence is notcleaved. It may therefore function as both an uncleaved leadersequence and atransmembrane 
anchor in a manner similar to the leader peptide of PBP1 from N. gonorrhoeae [Ropp & Nicholas (1997) J. Bact. 179: 
30 2783-2787.]. Indeed the N-terminal region exhibits strong hydrophobic character and is predicted by the Tmpred. program 
to be transmembrane. 

Example 12 - lipoproteins 

35 [0191] The incorporation of palmitate in recombinant lipoproteins was demonstrated by the method of Kraft et. al. [J. 
Bact. (1 998) 1 80:3441 -3447.]. Single colonies harbouring the plasmid of interest were grown overnight at 37°C in 20 ml 
of LB/Amp (100[xg/ml) liquid culture. The culture was diluted to an OD 550 of 0.1 in 5.0 ml of fresh medium LB/Amp 
medium containing 5 ^xC/ml [ 3 H] palmitate (Amersham). When the OD 550 of the culture reached 0.4-0.8, recombinant 
lipoprotein was induced for 1 hour with IPTG (final concentration 1 .0 mM). Bacteria were harvested by centrifugation in 

40 a bench top centrifuge at 2700g for 15 min and washed twice with 1 .0 ml cold PBS. Cells were resuspended in 120^x1 
of 20 mM Tris-HCI (pH 8.0), 1 mM EDTA, 1 .0% w/v SDS and lysed by boiling for 10 min. After centrifugation at 13000g 
for 10 min the supernatant was collected and proteins precipitated by the addition of 1.2 ml cold acetone and left for 1 
hour at -20 °C. Protein was pelleted by centrifugation at 13000g for 10 min and resuspended in 20-50|xl (calculated to 
standardise loading with respect to the final O.D of the culture) of 1 .0% w/v SDS. An aliquot of 15 |xl was boiled with 

45 5jxl of SDS-PAGE sample buffer and analysed by SDS-PAGE. After electrophoresis gels were fixed for 1 hour in 10% 
v/v acetic acid and soaked for 30 minutes in Amplify solution (Amersham). The gel was vacuum-dried under heat and 
exposed to Hyperfilm (Kodak) overnight -80 °C. 

[0192] Incorporation of the [ 3 H] palmitate label, confirming lipidation, was found for the following proteins: Orf4L, 
Orf25L, 287L, 287LOrf4, 406. L, 576L, 926L, 91 9L and 919LOrf4. 

50 

Example 13- domains in 287 

[0193] Based on homology of different regions of 287 to proteins that belong to different functional classes, it was 
split into three 'domains', as shown in Figure 5. The second domain shows homology to IgA proteases, and the third 
55 domain shows homology to transferrin-binding proteins. 

[0194] Each of the three 'domains' shows a different degree of sequence conservation between N. meningitidis strains 
- domain C is 98% identical, domain A is 83% identical, whilst domain B is only 71% identical. Note that protein 287 in 
strain MC58 is 61 amino acids longer than that of strain 2996. An alignment of the two sequences is shown in Figure 
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7, and alignments for various strains are disclosed in WO00/66741 (see Figures 5 and 15 therein). 

[0195] The three domains were expressed individually as C-terminal His-tagged proteins. This was done for the MC58 

and 2996 strains, using the following constructs: 

287a-MC58 (aa 1-202), 287b-MC58 (aa 203-288), 287c-MC58 (aa 31 1-488). 

5 287a-2996 (aa 1-139), 287b-2996 (aa 140-225), 287C-2996 (aa 250-427). 

[0196] To make these constructs, the stop codon sequence was omitted in the3'-end primer sequence. The 5' primers 
included the Nhe\ restriction site, and the 3' primers included a Xho\ as a tail, in order to direct the cloning of each 
amplified fragment into the expression vector pET21 b-i- using Nde\-Xho\, Nhe\-Xho\ or Nde\-Hind\\\ restriction sites. 
[0197] All six constructs could be expressed, but 287b-MC8 required denaturation and refolding for solubilisation. 

10 [0198] Deletion of domain A is described below ('A4 287-His'). 

[0199] Immunological data (serum bactericidal assay) were also obtained using the various domains from strain 2996, 
against the homologous and heterologous MenB strains, as well as MenA (F6124 strain) and MenC (BZ133 strain): 





2996 


BZ232 


MC58 


NGH38 


394/98 


MenA 


MenC 


287-His 


32000 


16 


4096 


4096 


512 


8000 


16000 


287(B)-His 


256 










16 




287(C)-His 


256 




32 


512 


32 


2048 


>2048 


287(B-C)-His 


64000 


128 


4096 


64000 


1024 


64000 


32000 



[0200] Using the domains of strain MC58, the following results were obtained: 





MC58 


2996 


BZ232 


NGH38 


394/98 


MenA 


MenC 


287-His 


4096 


32000 


16 


4096 


512 


8000 


16000 


287(B)-His 


128 


128 










128 


287(C)-His 




16 




1024 




512 




287(B-C)-His 


16000 


64000 


128 


64000 


512 


64000 


>8000 



Example 14 — deletions in 287 



[0201] As well as expressing individual domains, 287 was also expressed (as a C-terminal His-tagged protein) by 
making progressive deletions within the first domain. These 

[0202] Four deletion mutants of protein 287 from strain 2996 were used (Figure 6): 



1) '287-His', consisting of amino acids 18-427 (i.e. leader peptide deleted); 

2) 'A1 287-His', consisting of amino acids 26-427; 

3) 'A2 287-His', consisting of amino acids 70-427; 

4) 'A3 287-His', consisting of amino acids 107-427; and 

5) 'A4 287-His', consisting of amino acids 140-427 (=287-bc). 



[0203] The 'A4' protein was also made for strain MC58 ('A4 287MC58-His'; aa 203-488). 

[0204] The constructs were made in the same way as 287a/b/c, as described above. 

[0205] All six constructs could be expressed and protein could be purified. Expression of 287-His was, however, quite 
poor. 

[0206] Expression was also high when the C-terminal His-tags were omitted. 

[0207] Immunological data (serum bactericidal assay) were also obtained using the deletion mutants, against the 
homologous (2996) and heterologous MenB strains, as well as MenA (F6124 strain) and MenC (BZ133 strain): 



55 





2996 


BZ232 


MC58 


NGH38 


394/98 


MenA 


MenC 


287-his 


32000 


16 


4096 


4096 


512 


8000 


16000 


A1 287-His 


16000 


128 


4096 


4096 


1024 


8000 


16000 


A2 287-His 


16000 


128 


4096 


>2048 


512 


16000 


>8000 
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(continued) 





2996 


BZ232 


MC58 


NGH38 


394/98 


MenA 


MenC 


A3 287-His 


16000 


128 


4096 


>2048 


512 


16000 


>8000 


A4 287-His 


64000 


128 


4096 


64000 


1024 


64000 


32000 



[0208] The same high activity for the A4 deletion was seen using the sequence from strain MC58. 

[0209] As well as showing superior expression characteristics, therefore, the mutants are immunologically equivalent 

or superior. 

Example 15 - poly-glycine deletions 

[0210] The 'A1 287-His' construct of the previous example differs from 287-His and from '287 unta 9g ed ' only by a short 
N-terminal deletion (GGGGGGS). Using an expression vector which replaces the deleted serine with a codon present 
in the Nhe cloning site, however, this amounts to a deletion only of (Gly) 6 . Thus, the deletion of this (Gly) 6 sequence 
has been shown to have a dramatic effect on protein expression. 

[021 1] The protein lacking the N-terminal amino acids up to GGGGGG is called 'AG 287'. In strain MC58, its sequence 
(leader peptide underlined) is: 

( AG287 

1 MFKRSVIAMA CIFALSA CGG GGGGSPDVKS ADTLSKPAAP VVSEKETEAK 
51 EDAPQAGSQG QGAPSAQGSQ DMAAVSEENT GNGGAVTADN PKNEDEVAQN 
101 DMPQNAAGTD SSTPNHTPDP NMLAGNMENQ ATDAGESSQP ANQPDMANAA 
151 DGMQGDDPSA GGQNAGNTAA QGANQAGNNQ AAGSSDPIPA SNPAPANGGS 
201 NFGRVDLANG VLIDGPSQNI TLTHCKGDSC SGNNFLDEEV QLKSEFEKLS 
251 DADKISNYKK DGKNDKFVGL VADSVQMKGI NQYIIFYKPK PTSFARFRRS 
301 ARSRRSLPAE MPLIPVNQAD TLIVDGEAVS LTGHSGNIFA PEGNYRYLTY 
351 GAEKLPGGSY ALRVQGEPAK GEMLAGAAVY NGEVLHFHTE NGRPYPTRGR 
401 FAAKVDFGSK SVDGIIDSGD DLHMGTQKFK AAIDGNGFKG TWTENGSGDV 
4 51 SGKFYGPAGE EVAGKYSYRP TDAEKGGFGV FAGKKEQD* 

[0212] AG287, with or without His-tag ('AG287-His' and 'AG287K', respectively), are expressed at very good levels 
in comparison with the '287-His' or '287 untagged' 

[0213] On the basis of gene variability data, variants of AG287-His were expressed in E.coli from a number of MenB 
strains, in particular from strains 2996, MC58, 1000, and BZ232. The results were also good. 

[0214] It was hypothesised that poly-Gly deletion might be a general strategy to improve expression. Other MenB 
lipoproteins containing similar (Gly) n motifs (near the N-terminus, downstream of a cysteine) were therefore identified, 
namely Tbp2 (NMB0460), 741 (NMB 1870) and 983 (NMB1969): 



TBP2 ( AGTbp2 

1 MNNPLVNQAA MVLPVFLLSA CLGGGGSFDL DSVDTEAPRP APKYQDVFSE 

51 KPQAQKDQGG YGFAMRLKRR NWYPQAKEDE VKLDESDWEA TGLPDEPKEL 

101 PKRQKSVIEK VETDSDNNIY SSPYLKPSNH QNGNTGNGIN QPKNQAKDYE 

151 NFKYVYSGWF YKHAKREFNL KVEPKSAKNG DDGYIFYHGK EPSRQLPASG 

201 KITYKGVWHF ATDTKKGQKF REIIQPSKSQ GDRYSGFSGD DGEEYSNKNK 

251 STLTDGQEGY GFTSNLEVDF HNKKLTGKLI RNNANTDNNQ ATTTQYYSLE 

301 AQVTGNRFNG KATATDKPQQ NSETKEHPFV SDSSSLSGGF FGPQGEELGF 

351 RFLSDDQKVA VVGSAKTKDK PANGNTAAAS GGTDAAASNG AAGTSSENGK 

4 01 LTTVLDAVEL KLGDKEVQKL DNFSNAAQLV VDGIMIPLLP EASESGNNQA 
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4 51 NQGTNGGTAF TRKFDHTPES DKKDAQAGTQ TNGAQTASNT AGDTNGKTKT 

501 YEVEVCCSNL NYLKYGMLTR KNSKSAMQAG ESSSQADAKT EQVEQSMFLQ 

551 GERTDEKEIP SEQNIVYRGS WYGYIANDKS TSWSGNASNA TSGNRAEFTV 

601 NFADKKITGT LTADNRQEAT FTIDGNIKDN GFEGTAKTAE SGFDLDQSNT 

651 TRTPKAYITD AKVQGGFYGP KAEELGGWFA YPGDKQTKNA TNASGNSSAT 

701 VVFGAKRQQP VR* 



741 f AG741 

1 VNRTAFCCLS LTTALILTA C SSGGGGVAAD IGAGLADALT APLDHKDKGL 

51 QSLTLDQSVR KNEKLKLAAQ GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ 

101 IEVDGQLITL ESGEFQVYKQ SHSALTAFQT EQIQDSEHSG KMVAKRQFRI 

151 GDIAGEHTSF DKLPEGGRAT YRGTAFGSDD AGGKLTYTID FAAKQGNGKI 

201 EHLKSPELNV DLAAADIKPD GKRHAVISGS VLYNQAEKGS YSLGIFGGKA 

251 QEVAGSAEVK TVNGIRHIGL AAKQ* 
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25 



30 



35 



983 

1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 



MRTTPTFPTK TFKPTAMALA VATTLSACLG 



NSRATTAKSA 
LHTGDFPNPN 
LYGRKEHGYN 
DIRHVKEIGH 
MVAAIRNAWV 
DYSGGDKTDE 
LLPFYEKDAQ 
MWCLSAPYEA 
LRTTLLTTAQ 
SDIAYSFRND 
KSDMRVETKG 
LDGKGTLYTR 
AAKIGQDYSF 
ASAAAHSAPA 
DMPGIRPYGA 
RRLKAVSDGL 
KTGENTTAAA 
FSYGRYKNSI 
GGLRYDLLKQ 
FAT AG VE R D L 
FGNGWNGLAR 



AVSYAGIKNE 
DAYKNLINLK 
ENYKNYTAYM 
IDLVSHIIGG 
KLGERGVRIV 
GIRLMQQSDY 
KGIITVAGVD 
SVRFTRTNPI 
DIGAVGVDSK 
ISGTGGLIKK 
ALIYNGAASG 
LGKLLKVDGT 
FTNIETDGGL 
GLKHAVEQGG 
TFRAAAAVQH 
DHNGTGLRVI 
TLGMGRSTWS 
SRSTGADEHA 
DAFAEKGSAL 
NGRDYTVTGG 
YSYAGSKQYG 



MCKDRSMLCA 
PAIEAGYTGR 
RKEAPEDGGG 
RSVDGRPAGG 
NNSFGTTSRA 
GNLSYHIRNK 
RSGEKFKREM 
QIAGTSFSAP 
FGWGLLDAGK 
GGSQLQLHGN 
GSLNSDGIVY 
AIIGGKLYMS 
LASLDSVEKT 
SNLENLMVEL 
ANAADGVRIF 
AQTQQDGGTW 
ENSANAKTDS 
EGSVNGTLMQ 
GWSGNSLTEG 
FTGATAATGK 
NHSGRVGVGY 



( AG 9 
GGGGGTSAPD 
GRDDVAVTDR 
GVEVGIVDTG 
KDIEASFDDE 
IAPDATLHIM 
GTADLFQIAN 
NMLFIFSTGN 
YGEPGTEPLE 
IVTGTAALLL 
AMNGPASFPF 
NTYTGKTIIE 
LADTDQSGAN 
ARGKGAGYLN 
AGSEGDTLSY 
DASESSATPE 
NSLAATVYAD 
EQGGVEGKMR 
ISLFAGIRHD 
LGALGGVNVP 
TLVGLAGLKL 
TGARNMPHTR 
RF* 



83 

FNAGGTGIGS 
DAKINAPPPN 
ESVGSISFPE 
AVIETEAKPT 
NTNDETKNEM 
SEEQYRQALL 
DAQAQPNTYA 
YGSNHCGITA 
QKYPWMSNDN 
GDFTADTKGT 
GGSLVLYGNN 
ETVHIKGSLQ 
STGRRVPFLS 
YVRRGNAART 
TVETAAADRT 
STAAHADMQG 
GSTQTVGIAA 
AGDIGYLKGL 
FAATGDLTVE 
SQPLSDKAVL 
LVAGLGADVE 



40 

[0215] Tbp2 and 741 genes were from strain MC58; 983 and 287 genes were from strain 2996. These were cloned 
in pET vector and expressed in E.coli without the sequence coding for their leader peptides or as "AG forms", both fused 
to a C-terminal His-tag. In each case, the same effect was seen - expression was good in the clones carrying the deletion 
of the poly-glycine stretch, and poor or absent if the glycines were present in the expressed protein: 

45 



ORF 


Express. 


Purification 


Bact. Activity 


287-His(2996) 


+/- 


+ 


+ 


'287 unta gg ed '(2996) 


+/- 


nd 


nd 


AG287-His(2996) 


+ 


+ 


+ 


AG287K(2996) 


+ 


+ 


+ 


AG287-His(MC58) 


+ 


+ 


+ 


AG287-His(1000) 


+ 


+ 


+ 


AG287-His(BZ232) 


+ 


+ 


+ 


Tbp2-His(MC58) 


+/- 


nd 


nd 


AGTbp2-His(MC58) 


+ 


+ 




741-His(MC58) 


+/- 


nd 


nd 
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(continued) 



ORF 


Express. 


Purification Bact. Activity 


AG741-His(MC58) 


+ 


+ 


983-His (2996) 






AG983-His (2996) 


+ 


+ 



[0216] SDS-PAGE of the proteins is shown in Figure 13. 
AG 287 and hybrids 

[0217] AG287 proteins were made and purified for strains MC58, 1000 and BZ232. Each of these gave high ELISA 
titres and also serum bactericidal titres of>81 92. AG287K, expressed from pET-24b, gave excellent titres in ELISA and 
the serum bactericidal assay. AG287-ORF46.1 K may also be expressed in pET-24b. 

[0218] AG287 was also fused directly in-frame upstream of 91 9, 953, 961 (sequences shown below) and ORF46.1 : 



AG287-919 

1 ATGGCTAGCC CCGATGTTAA ATCGGCGGAC ACGCTGTCAA AACCGGCCGC 

51 TCCTGTTGTT GCTGAAAAAG AGACAGAGGT AAAAGAAGAT GCGCCACAGG 

101 CAGGTTCTCA AGGACAGGGC GCGCCATCCA CACAAGGCAG CCAAGATATG 

151 GCGGCAGTTT CGGCAGAAAA TACAGGCAAT GGCGGTGCGG CAACAACGGA 

201 CAAACCCAAA AATGAAGACG AGGGACCGCA AAATGATATG CCGCAAAATT 

251 CCGCCGAATC CGCAAATCAA ACAGGGAACA ACCAACCCGC CGATTCTTCA 

301 GATTCCGCCC CCGCGTCAAA CCCTGCACCT GCGAATGGCG GTAGCAATTT 

351 TGGAAGGGTT GATTTGGCTA ATGGCGTTTT GATTGATGGG CCGTCGCAAA 

401 ATATAACGTT GACCCACTGT AAAGGCGATT CTTGTAATGG TGATAATTTA 

451 TTGGATGAAG AAGCACCGTC AAAATCAGAA TTTGAAAATT TAAATGAGTC 

501 TGAACGAATT GAGAAATATA AGAAAGATGG GAAAAGCGAT AAATTTACTA 

551 ATTTGGTTGC GACAGCAGTT CAAGCTAATG GAACTAACAA ATATGTCATC 

601 ATTTATAAAG ACAAGTCCGC TTCATCTTCA TCTGCGCGAT TCAGGCGTTC 

651 TGCACGGTCG AGGAGGTCGC TTCCTGCCGA GATGCCGCTA ATCCCCGTCA 

7 01 ATCAGGCGGA TACGCTGATT GTCGATGGGG AAGCGGTCAG CCTGACGGGG 

7 51 CATTCCGGCA ATATCTTCGC GCCCGAAGGG AATTACCGGT ATCTGACTTA 

801 CGGGGCGGAA AAATTGCCCG GCGGATCGTA TGCCCTCCGT GTGCAAGGCG 

851 AACCGGCAAA AGGCGAAATG CTTGCTGGCA CGGCCGTGTA CAACGGCGAA 

901 GTGCTGCATT TTCATACGGA AAACGGCCGT CCGTACCCGA CTAGAGGCAG 

951 GTTTGCCGCA AAAGTCGATT TCGGCAGCAA ATCTGTGGAC GG CAT TAT CG 

1001 ACAGCGGCGA TGATTTGCAT ATGGGTACGC AAAAATTCAA AGCCGCCATC 
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10 



15 



20 



25 



1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 



GATGGAAACG 
TTCCGGAAGG 
GCTATCGCCC 
AAAAAAGAGC 
CCAAACCTTT 
CGGTCGGCAT 
GTCTATACCG 
TTTCGCCAAA 
ACCGCCAAGG 
CATTCCTTTC 
GGTTGCAGGC 
CGGTGCTGAA 
TACGGTATTC 
GAGCGGAAAA 
CAATCGACAA 
ATCACCGCGC 
CCTCCCCTAC 
AAGCCCCGAT 
CACATCCAAG 
CATCGGCTAT 
ATATGGCGGA 
ATCAAAGCCT 
TCAAAACCCC 
ACGGTCCCGT 
GCAGTCGACC 
CGCCCATCCG 
ATACCGGCAG 
TACGGCGACG 
CGTCTGGCAG 
TCGAG 



GCTTTAAGGG 
TTTTACGGCC 
GACAGATGCG 
AGGATGGATC 
CCGCAACCCG 
CCCCGACCCC 
TTGTACCGCA 
AGCCTGCAAT 
CTGGCAGGAT 
AGGCAAAACA 
AACGGAAGCC 
GGGCGACGAC 
CCGACGATTT 
GCCCTTGTCC 
TACCGGCGGC 
GCACAACGGC 
CACACGCGCA 
ACTCGGTTAC 
GCTCGGGCCG 
GCCGACAAAA 
CAAAGGCTAC 
ATATGCGGCA 
AGCTATATCT 
CGGCGCACTG 
GGCACTACAT 
GTTACCCGCA 
CGCGATTAAA 
AAGCCGGCGA 
CTCCTACCCA 



GACTTGGACG 
CGGCCGGCGA 
GAAAAGGGCG 
CGGAGGAGGA 
ACACATCCGT 
GCCGGAACGA 
CCTGTCCCTG 
CCTTCCGCCT 
GTGTGCGCCC 
GTTTTTTGAA 
TTGCCGGTAC 
AGGCGGACGG 
TATCTCCGTC 
GCATCAGGCA 
ACACATACCG 
AATCAAAGGC 
ACCAAATCAA 
GCCGAAGACC 
TCTGAAAACC 
ACGAACATCC 
CTCAAGCTCG 
AAATCCGCAA 
TTTTCCGCGA 
GGCACGCCGT 
TACCTTGGGC 
AAGCCCTCAA 
GGCGCGGTGC 
ACTTGCCGGC 
ACGGTATGAA 



GAAAATGGCG 
GGAAGTGGCG 
GATTCGGCGT 
GGATGCCAAA 
CATCAACGGC 
CGGTCGGCGG 
CCCCACTGGG 
CGGCTGCGCC 
AAGCCTTTCA 
CGCTATTTCA 
GGTTACCGGC 
CACAAGCCCG 
CCCCTGCCTG 
GACGGGAAAA 
CCGACCTCTC 
AGGTTTGAAG 
CGGCGGCGCG 
CCGTCGAACT 
CCGTCCGGCA 
CTACGTTTCC 
GGCAGACCTC 
CGCCTCGCCG 
GCTTGCCGGA 
TGATGGGGGA 
GCGCCCTTAT 
CCGCCTGATT 
GCGTGGATTA 
AAACAGAAAA 
GCCCGAATAC 



GCGGGGATGT 
GGAAAATACA 
GTTTGCCGGC 
GCAAGAGCAT 
CCGGACCGGC 
CGGCGGGGCC 
CGGCGCAGGA 
AATTTGAAAA 
AACCCCCGTC 
CGCCGTGGCA 
TATTACGAGC 
CTTCCCGATT 
CCGGTTTGCG 
AACAGCGGCA 
CCGATTCCCC 
GAAGCCGCTT 
CTTGACGGCA 
TTTTTTTATG 
AATACATCCG 
ATCGGACGCT 
GATGCAGGGC 
AAGTTTTGGG 
AGCAGCAATG 
ATATGCCGGC 
TTGTCGCCAC 
ATGGCGCAGG 
TTTTTGGGGA 
CCACGGGTTA 
CGCCCGTAAC 



30 



35 



40 



45 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 



MASPDVKSAD 
AAVSAENTGN 
DSAPASNPAP 
LDEEAPSKSE 
IYKDKSASSS 
HSGNIFAPEG 
VLHFHTENGR 
DGNGFKGTWT 
KKEQDGSGGG 
VYTWPHLSL 
HSFQAKQFFE 
YGIPDDFISV 
ITARTTAIKG 
HIQGSGRLKT 
IKAYMRQNPQ 
AVDRHYITLG 
YGDEAGELAG 



TLSKPAAPW 
GGAATTDKPK 
ANGGSNFGRV 
FENLNESERI 
SARFRRSARS 
NYRYLTYGAE 
PYPTRGRFAA 
ENGGGDVSGR 
GCQSKSIQTF 
PHWAAQDFAK 
RYFT PWQVAG 
PLPAGLRSGK 
RFEGSRFLPY 
PSGKYIRIGY 
RLAEVLGQNP 
APLFVATAHP 
KQKTTGYVWQ 



AEKETEVKED 
NEDEGPQNDM 
DLANGVLIDG 
EKYKKDGKSD 
RRSLPAEMPL 
KLPGGSYALR 
KVDFGSKSVD 
FYG PAGE EVA 
PQPDTSVING 
SLQSFRLGCA 
NGSLAGTVTG 
ALVRIRQTGK 
HTRNQINGGA 
ADKNEHPYVS 
SYIFFRELAG 
VTRKALNRLI 
LLPNGMKPEY 



APQAGSQGQG 
PQNSAESANQ 
PSQNITLTHC 
KFTNLVATAV 
IPVNQADTLI 
VQGEPAKGEM 
GIIDSGDDLH 
GKYSYRPTDA 
PDRPVGIPDP 
NLKNRQGWQD 
YYEPVLKGDD 
NSGTIDNTGG 
LDGKAPILGY 
IGRYMADKGY 
SSNDGPVGAL 
MAQDTGSAIK 
RP* 



APSTQGSQDM 
TGNNQPADSS 
KGDSCNGDNL 
QANGTNKYVI 
VDGEAVSLTG 
LAGTAVYNGE 
MGTQKFKAAI 
EKGGFGVFAG 
AGTTVGGGGA 
VCAQAFQTPV 
RRTAQARFPI 
THTADLSRFP 
AEDPVELFFM 
LKLGQTSMQG 
GTPLMGEYAG 
GAVRVDYFWG 



50 



55 



25 



EP 1 790 660 A2 



AG287-953 



10 



15 



1 


ATGGCTAGCC 


CCGATGTTAA 


51 


TCCTGTTGTT 


GCTGAAAAAG 


101 


CAGGTTCTCA 


AGGACAGGGC 


151 


GCGGCAGTTT 


CGGCAGAAAA 


201 


CAAACCCAAA 


AATGAAGACG 


251 


CCGCCGAATC 


CGCAAATCAA 


301 


GATTCCGCCC 


CCGCGTCAAA 


351 


TGGAAGGGTT 


GATTTGGCTA 


401 


ATATAACGTT 


GACCCACTGT 


451 


TTGGATGAAG 


AAGCACCGTC 


501 


TGAACGAATT 


GAGAAATATA 


551 


ATTTGGTTGC 


GACAGCAGTT 


601 


ATTTATAAAG 


ACAAGTCCGC 


651 


TGCACGGTCG 


AGGAGGTCGC 


701 


ATCAGGCGGA 


TACGCTGATT 


751 


CATTCCGGCA 


ATATCTTCGC 



ATCGGCGGAC 
AGACAGAGGT 
GCGCCATCCA 
TACAGGCAAT 
AGGGACCGCA 
ACAGGGAACA 
CCCTGCACCT 
ATGGCGTTTT 
AAAGGCGATT 
AAAAT C AGAA 
AGAAAGATGG 
CAAGCTAATG 
TTCATCTTCA 
TTCCTGCCGA 
GTCGATGGGG 
GCCCGAAGGG 



ACGCTGTCAA 
AAAAGAAGAT 
CACAAGGCAG 
GGCGGTGCGG 
AAATGATATG 
ACCAACCCGC 
GCGAATGGCG 
GATTGATGGG 
CTTGTAATGG 
TTTGAAAATT 
GAAAAGCGAT 
GAACTAACAA 
TCTGCGCGAT 
GATGCCGCTA 
AAGCGGTCAG 
AATTACCGGT 



AACCGGCCGC 
GCGCCACAGG 
CCAAGATATG 
CAACAACGGA 
CCGCAAAATT 
CGATTCTTCA 
GTAGCAATTT 
CCGTCGCAAA 
TGATAATTTA 
TAAATGAGTC 
AAATTTACTA 
ATATGTCATC 
TCAGGCGTTC 
ATCCCCGTCA 
CCTGACGGGG 
ATCTGACTTA 



801 CGGGGCGGAA AAATTGCCCG GCGGATCGTA TGCCCTCCGT GTGCAAGGCG 

851 AACCGGCAAA AGGCGAAATG CTTGCTGGCA CGGCCGTGTA CAACGGCGAA 

901 GTGCTGCATT TTCATACGGA AAACGGCCGT CCGTACCCGA CTAGAGGCAG 

951 GTTTGCCGCA AAAGTCGATT TCGGCAGCAA ATCTGTGGAC GGCATTATCG 

1001 ACAGCGGCGA TGATTTGCAT ATGGGTACGC AAAAATTCAA AGCCGCCATC 

1051 GATGGAAACG GCTTTAAGGG GACTTGGACG GAAAATGGCG GCGGGGATGT 

1101 TTCCGGAAGG TTTTACGGCC CGGCCGGCGA GGAAGTGGCG GGAAAATACA 

1151 GCTATCGCCC GACAGATGCG GAAAAGGGCG GATTCGGCGT GTTTGCCGGC 

1201 AAAAAAGAGC AGGATGGATC CGGAGGAGGA GGAGCCACCT ACAAAGTGGA 

1251 CGAATATCAC GCCAACGCCC GTTTCGCCAT CGACCATTTC AACACCAGCA 

1301 CCAACGTCGG CGGTTTTTAC GGTCTGACCG GTTCCGTCGA GTTCGACCAA 

1351 GCAAAACGCG ACGGTAAAAT CGACATCACC ATCCCCGTTG CCAACCTGCA 

14 01 AAGCGGTTCG CAACACTTTA CCGACCACCT GAAATCAGCC GACATCTTCG 

14 51 ATGCCGCCCA ATATCCGGAC ATCCGCTTTG TTTCCACCAA ATTCAACTTC 

1501 AACGGCAAAA AACTGGTTTC CGTTGACGGC AACCTGACCA TGCACGGCAA 

1551 AACCGCCCCC GTCAAACTCA AAGCCGAAAA ATTCAACTGC TACCAAAGCC 

1601 CGATGGCGAA AACCGAAGTT TGCGGCGGCG ACTTCAGCAC CACCATCGAC 

1651 CGCACCAAAT GGGGCGTGGA CTACCTCGTT AACGTTGGTA TGACCAAAAG 

1701 CGTCCGCATC G AC AT C C AAA TCGAGGCAGC CAAACAATAA CTCGAG 



40 1 MASPDVKSAD TLSKPAAPW AEKETEVKED APQAGSQGQG APSTQGSQDM 

51 AAVSAENTGN GGAATTDKPK NEDEGPQNDM PQNSAESANQ TGNNQPADSS 

101 DSAPASNPAP ANGGSNFGRV DLANGVLIDG PSQNITLTHC KGDSCNGDNL 

151 LDEEAPSKSE FENLNESERI EKYKKDGKSD KFTNLVATAV QANGTNKYVI 

201 IYKDKSASSS SARFRRSARS RRSLPAEMPL IPVNQADTLI VDGEAVSLTG 

251 HSGNIFAPEG NYRYLTYGAE KLPGGSYALR VQGEPAKGEM LAGTAVYNGE 

45 301 VLHFHTENGR PYPTRGRFAA KVDFGSKSVD GIIDSGDDLH MGTQKFKAAI 

351 DGNGFKGTWT ENGGGDVSGR FYGPAGEEVA GKYSYRPTDA EKGGFGVFAG 

401 KKEQDGSGGG GATYKVDEYH ANARFAIDHF NTSTNVGGFY GLTGSVEFDQ 

451 AKRDGKIDIT IPVANLQSGS QHFTDHLKSA DIFDAAQYPD IRFVSTKFNF 

501 NGKKLVSVDG NLTMHGKTAP VKLKAEKFNC YQSPMAKTEV CGGDFSTTID 

551 RTKWGVDYLV NVGMTKSVRI DIQIEAAKQ* 
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AG287— 961 



10 



15 



20 



25 



30 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 



ATGGCTAGCC 
TCCTGTTGTT 
CAGGTTCTCA 
GCGGCAGTTT 
CAAACCCAAA 
CCGCCGAATC 
GATTCCGCCC 
TGGAAGGGTT 
ATATAACGTT 
TTGGATGAAG 
TGAACGAATT 
ATTTGGTTGC 
ATTTATAAAG 
TGCACGGTCG 
ATCAGGCGGA 
CATTCCGGCA 
CGGGGCGGAA 
AACCGGCAAA 
GTGCTGCATT 
GTTTGCCGCA 
ACAGCGGCGA 
GATGGAAACG 
TTCCGGAAGG 
GCTATCGCCC 
AAAAAAGAGC 
TGTTAAAAAA 
AAGAAATCAA 
GACGGCACAA 
CGACTTTAAA 
CCGTCAATGA 
TCTGAAATAG 
AGCAGATACT 



CCGATGTTAA 
GCTGAAAAAG 
AGGACAGGGC 
CGGCAGAAAA 
AATGAAGACG 
CGCAAATCAA 
CCGCGTCAAA 
GATTTGGCTA 
GACCCACTGT 
AAGCACCGTC 
GAGAAATATA 
GACAGCAGTT 
ACAAGT CCGC 
AGGAGGTCGC 
TACGCTGATT 
ATATCTTCGC 
AAATTGCCCG 
AGGCGAAATG 
TTCATACGGA 
AAAGTCGATT 
TGATTTGCAT 
GCTTTAAGGG 
TTTTACGGCC 
GACAGATGCG 
AGGATGGATC 
GCTGCCACTG 
CGGTTTCAAA 
TTACCAAAAA 
GGTCTGGGTC 
AAACAAACAA 
AAAAGTTAAC 
GATGCCGCTC 



ATCGGCGGAC 
AGACAGAGGT 
GCGCCATCCA 
TACAGGCAAT 
AGGGACCGCA 
ACAGGGAACA 
CCCTGCACCT 
ATGGCGTTTT 
AAAGGCGATT 
AAAATCAGAA 
AG AAAG AT G G 
CAAGCTAATG 
TTCATCTTCA 
TTCCTGCCGA 
GTCGATGGGG 
GCCCGAAGGG 
GCGGATCGTA 
CTTGCTGGCA 
AAACGGCCGT 
TCGGCAGCAA 
ATGGGTACGC 
GACTTGGACG 
CGGCCGGCGA 
GAAAAGGGCG 
CGGAGGAGGA 
TGGCCATTGC 
GCTGGAGAGA 
AGACGCAACT 
TGAAAAAAGT 
AACGTCGATG 
AACCAAGTTA 
TGGATGCAAC 



ACGCTGTCAA 
AAAAGAAGAT 
CACAAGGCAG 
GGCGGTGCGG 
AAATGATATG 
ACCAACCCGC 
GCGAATGGCG 
GATTGATGGG 
CTTGTAATGG 
TTTGAAAATT 
GAAAAGCGAT 
GAACTAACAA 
TCTGCGCGAT 
GATGCCGCTA 
AAGCGGTCAG 
AATTACCGGT 
TGCCCTCCGT 
CGGCCGTGTA 
CCGTACCCGA 
ATCTGTGGAC 
AAAAATTCAA 
GAAAATGGCG 
GGAAGTGGCG 
GATTCGGCGT 
GGAGCCACAA 
TGCTGCCTAC 
CCATCTACGA 
GCAGCCGATG 
CGTGACTAAC 
CCAAAGTAAA 
GCAGACACTG 
CACCAACGCC 



AACCGGCCGC 
GCGCCACAGG 
CCAAGATATG 
CAACAACGGA 
CCGCAAAATT 
CGATTCTTCA 
GTAGCAATTT 
CCGTCGCAAA 
TGATAATTTA 
TAAATGAGTC 
AAATTTACTA 
ATATGTCATC 
TCAGGCGTTC 
ATCCCCGTCA 
CCTGACGGGG 
ATCTGACTTA 
GTGCAAGGCG 
CAACGGCGAA 
CTAGAGGCAG 
GGCATTATCG 
AGCCGCCATC 
GCGGGGATGT 
GGAAAATACA 
GTTTGCCGGC 
ACGACGACGA 
AACAATGGCC 
CATTGATGAA 
TTGAAGCCGA 
CTGACCAAAA 
AGCTGCAGAA 
ATGCCGCTTT 
TTGAATAAAT 



35 



40 



45 



1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 



TGGGAGAAAA 
AAAATTGATG 
CGAAGCATTC 
CAGACGAAGC 
ACCAAACAAA 
CAAAGCCGAA 
AAGCTGTCGC 
AAAGATAATA 
AGAGTCTGAC 
AAAAATTGGA 
GATACTCGCC 
AACCCGCCAA 
CTTACAACGT 
TCCGAATCGG 
TGCCGCCAAA 
CCTACCATGT 



TATAACGACA 
AAAAAT T AG A 
AACGATATCG 
CGTCAAAACC 
ACGTCGATGC 
GCTGCCGCTG 
TGCAAAAGTT 
TTGCTAAAAA 
AGCAAATTTG 
CACACGCTTG 
TGAACGGTTT 
GGCCTTGCAG 
GGGTCGGTTC 
CAGTCGCCAT 
GCAGGCGTGG 
CGGCGTCAAT 



TTTGCTGAAG 
AGCCGTGGCT 
CCGATTCATT 
GCCAATGAAG 
CAAAGTAAAA 
GCACAGCTAA 
ACCGACATCA 
AGCAAACAGT 
TCAGAATTGA 
GCTTCTGCTG 
G G AT AAAAC A 
AACAAGCCGC 
AATGTAACGG 
CGGTACCGGC 
CAGTCGGCAC 
TACGAGTGGT 



AGACTAAGAC 
GATACCGTCG 
GGATGAAACC 
CCAAACAGAC 
GCTGCAGAAA 
TACTGCAGCC 
AAGCTGATAT 
GCCGACGTGT 
TGGTCTGAAC 
AAAAAT C CAT 
GTGTCAGACC 
GCTCTCCGGT 
CTGCAGTCGG 
TTCCGCTTTA 
TTCGTCCGGT 
AACTCGAG 



AAATATCGTA 
ACAAGCATGC 
AACACTAAGG 
GGCCGAAGAA 
CTGCAGCAGG 
GACAAGGCCG 
CGCTACGAAC 
ACACCAGAGA 
GCTACTACCG 
TGCCGATCAC 
TGCGCAAAGA 
CTGTTCCAAC 
CGGCTACAAA 
CCGAAAACTT 
TCTTCCGCAG 
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15 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 



MASPDVKSAD 
AAVSAENTGN 
DSAPASNPAP 
LDEEAPSKSE 
IYKDKSASSS 
HSGNIFAPEG 
VLHFHTENGR 
DGNGFKGTWT 
KKEQDGSGGG 
DGTITKKDAT 
SEIEKLTTKL 
KIDEKLEAVA 
TKQNVDAKVK 
KDNIAKKANS 
DTRLNGLDKT 
SESAVAIGTG 



TLSKPAAPVV 
GGAATTDKPK 
ANGGSNFGRV 
FENLNESERI 
SARFRRSARS 
NYRYLTYGAE 
PYPTRGRFAA 
ENGGGDVSGR 
GATNDDDVKK 
AADVEADDFK 
ADTDAALADT 
DTVDKHAEAF 
AAETAAGKAE 
ADVYTREESD 
VSDLRKETRQ 
FRFTENFAAK 



AEKETEVKED 
NEDEGPQNDM 
DLANGVLIDG 
EKYKKDGKSD 
RRSLPAEMPL 
KLPGGSYALR 
KVDFGSKSVD 
FYGPAGEEVA 
AAT V A I AAA Y 
GLGLKKVVTN 
DAALDATTNA 
NDIADSLDET 
AAAGTANTAA 
SKFVRIDGLN 
GLAEQAALSG 
AGVAVGTSSG 



APQAGSQGQG 
PQNSAESANQ 
PSQNITLTHC 
KFTNLVATAV 
IPVNQADTLI 
VQGE PAKGEM 
GIIDSGDDLH 
GKYSYRPTDA 
NNGQEINGFK 
LTKTVNENKQ 
LNKLGENITT 
NTKADEAVKT 
DKAEAVAAKV 
ATTEKLDTRL 
LFQPYNVGRF 
SSAAYHVGVN 



APSTQGSQDM 
TGNNQPADSS 
KGDSCNGDNL 
QANGTNKYVI 
VDGEAVSLTG 
LAGTAVYNGE 
MGTQKFKAAI 
EKGGFGVFAG 
AGETIYDIDE 
N V D AKVKAAE 
FAEETKTNIV 
ANEAKQTAEE 
TDIKADIATN 
ASAEKSIADH 
NVTAAVGGYK 
YEW* 



20 





ELISA 


Bactericidal 


AG287-953-His 


3834 


65536 


AG287-961-His 


108627 


65536 



[0219] The bactericidal efficacy (homologous strain) of antibodies raised against the hybrid proteins was compared 
with antibodies raised against simple mixtures of the component antigens (using 287-GST) for 919 and ORF46.1 : 





Mixture with 287 


Hybrid with AG287 


919 


32000 


128000 


ORF46.1 


128 


16000 



[0220] Data for bactericidal activity against heterologous MenB strains and against serotypes A and C were also 
obtained: 





919 


ORF46.1 


Strain 


Mixture 


Hybrid 


Mixture 


Hybrid 


NGH38 


1024 


32000 




16384 


MC58 


512 


8192 




512 


BZ232 


512 


512 






Men A (F6124) 


512 


32000 




8192 


MenC (C11) 


>2048 


>2048 






MenC(BZ133) 


>4096 


64000 




8192 



[0221] The hybrid proteins with AG287 at the N-terminus are therefore immunologically superior to simple mixtures, 
with AG287-ORF46.1 being particularly effective, even against heterologous strains. AG287-ORF46.1 K may be ex- 
pressed in pET-24b. 

[0222] The same hybrid proteins were made using New Zealand strain 394/98 rather than 2996: 
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AG287NZ-919 



10 



15 



20 



25 



30 



35 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 



ATGGCTAGCC 
CCCTGTTGTT 
CAGGTTCTCA 
GCGGCGGTTT 
CAAACCCAAA 
CCGCCGATAC 
CCGGCCGGAA 
GCCGGCAAAC 
ACGATCCGTC 
ACAAATCAAG 
TTCAACCAAT 
ACGTGGGCAA 
ACCCACTGTA 
AGTACAGCTA 
GTAATTACAA 
GGTTTGGTTG 
CTTTTATAAA 
GGTCGAGGCG 
GCGGATACGC 
CGGCAATATC 
CGGAAAAATT 
TCAAAAGGCG 
GCATTTTCAT 
CCGCAAAAGT 
GGCGATGGTT 
AAACGGCTTT 
GAAAGTTTTA 
CGCCCAACAG 
AGAGCAGGAT 
CCTTTCCGCA 
GGCATCCCCG 
TACCGTTGTA 
CCAAAAGCCT 
CAAGGCTGGC 
CTTTCAGGCA 
CAGGCAACGG 
CTGAAGGGCG 
TATTCCCGAC 
GAAAAGCCCT 
GACAATACCG 
CGCGCGCACA 



CCGATGTCAA 
TCTGAAAAAG 
AGGACAGGGC 
CGGAAGAAAA 
AATGAAGACG 
AGATAGTTTG 
ATATGGAAAA 
CAACCGGATA 
GGCAGGCGGG 
CCGAAAACAA 
CCTAGCGCCA 
TTCTGTTGTG 
AAGGCGATTC 
AAATCAGAAT 
GAAAGATGGG 
CCGATAGTGT 
CCTAAACCCA 
GTCGCTTCCG 
TGATTGTCGA 
TTCGCGCCCG 
GCCCGGCGGA 
AAATGCTCGC 
ACGGAAAACG 
CGATTTCGGC 
TGCATATGGG 
AAGGGGACTT 
CGGCCCGGCC 
ATGCGGAAAA 
GGATCCGGAG 
ACCCGACACA 
ACCCCGCCGG 
CCGCACCTGT 
GCAATCCTTC 
AGGATGTGTG 
AAACAGTTTT 
AAGCCTTGCC 
ACGACAGGCG 
GATTTTATCT 
TGTCCGCATC 
GCGGCACACA 
ACGGCAATCA 



GTCGGCGGAC 
AGACAGAGGC 
GCGCCATCCG 
TACAGGCAAT 
AGGGGGCGCA 
ACACCGAATC 
CCAAGCACCG 
TGGCAAATAC 
GAAAATGCCG 
TCAAACCGCC 
CGAATAGCGG 
ATTGACGGGC 
TTGTAGTGGC 
TTGAAAAATT 
AAGAATGACG 
GCAGATGAAG 
CTTCATTTGC 
GCCGAGATGC 
TGGGGAAGCG 
AAGGGAATTA 
TCGTATGCCC 
GGGCACGGCA 
GCCGTCCGTC 
AGCAAATCTG 
TACGCAAAAA 
GGACGGAAAA 
GGCGAGGAAG 
GGGCGGATTC 
GAGGAGGATG 
TCCGTCATCA 
AACGACGGTC 
CCCTGCCCCA 
CGCCTCGGCT 
CGCCCAAGCC 
TTGAACGCTA 
GGTACGGTTA 
GACGGCACAA 
CCGTCCCCCT 
AGGCAGACGG 
TACCGCCGAC 
AAGGCAGGTT 



ACGCTGTCAA 
AAAGGAAGAT 
CACAAGGCGG 
GGCGGTGCGG 
AAATGATATG 
ACACCCCGGC 
GATGCCGGGG 
GGCGGACGGA 
GCAATACGGC 
GGTTCTCAAA 
TGGTGATTTT 
CGTCGCAAAA 
AATAATTTCT 
AAGTGATGCA 
GGAAGAATGA 
GGAATCAATC 
GCGATTTAGG 
CGCTGATTCC 
GTCAGCCTGA 
CCGGTATCTG 
TCCGTGTTCA 
GTGTACAACG 
CCCGTCCAGA 
TGGACGGCAT 
TTCAAAGCCG 
TGGCGGCGGG 
TGGCGGGAAA 
GGCGTGTTTG 
CCAAAGCAAG 
ACGGCCCGGA 
GGCGGCGGCG 
CTGGGCGGCG 
GCGCCAATTT 
TTTCAAACCC 
TTTCACGCCG 
CCGGCTATTA 
GCCCGCTTCC 
GCCTGCCGGT 
GAAAAAACAG 
CTCTCCCGAT 
TGAAGGAAGC 



AACCTGCCGC 
GCGCCACAGG 
TCAAGATATG 
CAGCAACGGA 
CCGCAAAATG 
TTCGAATATG 
AATCGGAGCA 
ATGCAGGGTG 
TGCCCAAGGT 
ATCCTGCCTC 
GGAAGGACGA 
TATAACGTTG 
TGGATGAAGA 
GACAAAATAA 
TAAATTTGTC 
AAT AT AT T AT 
CGTTCTGCAC 
CGTCAATCAG 
CGGGGCATTC 
ACTTACGGGG 
AGGCGAACCT 
GCGAAGTGCT 
GGCAGGTTTG 
TATCGACAGC 
C CATC GAT GG 
GATGTTTCCG 
ATACAGCTAT 
CCGGCAAAAA 
AG CAT C C AAA 
CCGGCCGGTC 
GGGCCGTCTA 
CAGGATTTCG 
GAAAAACCGC 
CCGTCCATTC 
TGGCAGGTTG 
CGAGCCGGTG 
CGATTTACGG 
TTGCGGAGCG 
CGGCACAATC 
TCCCCATCAC 
CGCTTCCTCC 



2051 CCTACCACAC GCGCAACCAA ATCAACGGCG GCGCGCTTGA CGGCAAAGCC 

2101 CCGATACTCG GTTACGCCGA AGACCCCGTC GAACTTTTTT TTATGCACAT 

2151 CCAAGGCTCG GGCCGTCTGA AAACCCCGTC CGGCAAATAC ATCCGCATCG 

2201 GCTATGCCGA CAAAAACGAA CATCCCTACG TTTCCATCGG ACGCTATATG 

2251 GCGGACAAAG GCTACCTCAA GCTCGGGCAG ACCTCGATGC AGGGCATCAA 

2301 AGCCTATATG CGGCAAAATC CGCAACGCCT CGCCGAAGTT TTGGGTCAAA 

2351 ACCCCAGCTA TATCTTTTTC CGCGAGCTTG CCGGAAGCAG CAATGACGGT 

24 01 CCCGTCGGCG CACTGGGCAC GCCGTTGATG GGGGAATATG CCGGCGCAGT 

24 51 CGACCGGCAC TACATTACCT TGGGCGCGCC CTTATTTGTC GCCACCGCCC 

2501 ATCCGGTTAC CCGCAAAGCC CTCAACCGCC TGATTATGGC GCAGGATACC 

2551 GGCAGCGCGA TTAAAGGCGC GGTGCGCGTG GATTATTTTT GGGGATACGG 

2601 CGACGAAGCC GGCGAACTTG CCGGCAAACA GAAAACCACG GGTTACGTCT 

2 651 GGCAGCTCCT ACCCAACGGT ATGAAGCCCG AATACCGCCC GTAAAAGCTT 
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1 MASPDVKSAD TLSKPAAPVV SEKETEAKED APQAGSQGQG APSAQGGQDM 

51 AAVSEENTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHTPASNM 

101 PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG ENAGNTAAQG 

151 TNQAENNQTA GSQN PASSTN PSATNSGGDF GRTNVGNSVV IDGPSQNITL 

201 THCKGDSCSG NNFLDEEVQL KSEFEKLSDA DKISNYKKDG KNDGKNDKFV 

251 GLVADSVQMK GINQYIIFYK PKPTSFARFR RSARSRRSLP AEMPLIPVNQ 

301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP 

351 SKGEMLAGTA VYNGEVLHFH TENGRPSPSR GRFAAKVDFG SKSVDGIIDS 

4 01 GDGLHMGTQK FKAAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY 

10 451 RPTDAEKGGF GVFAGKKEQD GSGGGGCQSK SIQTFPQPDT SVINGPDRPV 

501 GIPDPAGTTV GGGGAVYTW PHLSLPHWAA QDFAKSLQSF RLGCANLKNR 

551 QGWQDVCAQA FQTPVHSFQA KQFFERYFTP WQVAGNGSLA GTVTGYYEPV 

601 LKGDDRRTAQ ARFPIYGIPD DFISVPLPAG LRSGKALVRI RQTGKNSGTI 

651 DNTGGTHTAD LSRFPITART TAIKGRFEGS RFLPYHTRNQ INGGALDGKA 

701 PILGYAEDPV ELFFMHIQGS GRLKTPSGKY IRIGYADKNE HPYVSIGRYM 

15 7 51 ADKGYLKLGQ TSMQGIKAYM RQNPQRLAEV LGQNPSYIFF RELAGSSNDG 

801 PVGALGTPLM GEYAGAVDRH YITLGAPLFV ATAHPVTRKA LNRLIMAQDT 

851 GSAIKGAVRV DYFWGYGDEA GELAGKQKTT GYVWQLLPNG MKPEYRP* 



20 



AG287NZ-953 



25 



1 ATGGCTAGCC CCGATGTCAA GTCGGCGGAC ACGCTGTCAA AACCTGCCGC 

51 CCCTGTTGTT TCTGAAAAAG AG AC AG AG G C AAAGGAAGAT GCGCCACAGG 

101 CAGGTTCTCA AGGACAGGGC GCGCCATCCG CACAAGGCGG TCAAGATATG 

151 GCGGCGGTTT CGGAAGAAAA TACAGGCAAT GGCGGTGCGG CAGCAACGGA 

201 C AAACC C AAA AATGAAGACG AGGGGGCGCA AAATGATATG CCGCAAAATG 

251 CCGCCGATAC AGATAGTTTG ACACCGAATC ACACCCCGGC TTCGAATATG 

301 CCGGCCGGAA ATATGGAAAA CCAAGCACCG GATGCCGGGG AATCGGAGCA 

351 GCCGGCAAAC CAACCGGATA TGGCAAATAC GGCGGACGGA ATGCAGGGTG 

4 01 ACGATCCGTC GGCAGGCGGG GAAAATGCCG GCAATACGGC TGCCCAAGGT 

3Q 4 51 ACAAATCAAG CCGAAAACAA TCAAACCGCC GGTTCTCAAA ATCCTGCCTC 

501 TTCAACCAAT CCTAGCGCCA CGAATAGCGG TGGTGATTTT GGAAGGACGA 

551 ACGTGGGCAA TTCTGTTGTG ATTGACGGGC CGTCGCAAAA TATAACGTTG 

601 ACCCACTGTA AAGGCGATTC TTGTAGTGGC AATAATTTCT TGGATGAAGA 

651 AGTACAGCTA AAATCAGAAT TTGAAAAATT AAGTGATGCA GACAAAATAA 

7 01 GTAATTACAA GAAAGATGGG AAGAATGACG GGAAGAATGA TAAATTTGTC 

35 7 51 GGTTTGGTTG CCGATAGTGT GCAGATGAAG GGAATCAATC AATATATTAT 

801 CTTTTATAAA CCTAAACCCA CTTCATTTGC GCGATTTAGG CGTTCTGCAC 

851 GGTCGAGGCG GTCGCTTCCG GCCGAGATGC CGCTGATTCC CGTCAATCAG 

901 GCGGATACGC TGATTGTCGA TGGGGAAGCG GTCAGCCTGA CGGGGCATTC 

951 CGG CAATATC TTCGCGCCCG AAGGGAATTA CCGGTATCTG ACTTACGGGG 

1001 CGGAAAAATT GCCCGGCGGA TCGTATGCCC TCCGTGTTCA AGGCGAACCT 

40 1051 TCAAAAGGCG AAATGCTCGC GGGCACGGCA GTGTACAACG GCGAAGTGCT 

1101 GCATTTTCAT ACGGAAAACG GCCGTCCGTC CCCGTCCAGA GGCAGGTTTG 

1151 CCGCAAAAGT CGATTTCGGC AGCAAATCTG TGGACGGCAT TATCGACAGC 

1201 GGCGATGGTT TGCATATGGG TACGCAAAAA TTCAAAGCCG CCATCGATGG 

1251 AAACGGCTTT AAGGGGACTT GGACGGAAAA TGGCGGCGGG GATGTTTCCG 

1301 GAAAGTTTTA CGGCCCGGCC GGCGAGGAAG TGGCGGGAAA ATACAGCTAT 

1351 CGCCCAACAG ATGCGGAAAA GGGCGGATTC GGCGTGTTTG CCGGCAAAAA 

14 01 AGAGCAGGAT GGATCCGGAG GAGGAGGAGC CACCTACAAA GTGGACGAAT 

14 51 ATCACGCCAA CGCCCGTTTC GCCATCGACC ATTTCAACAC CAGCACCAAC 

1501 GTCGGCGGTT TTTACGGTCT GACCGGTTCC GTCGAGTTCG ACCAAGCAAA 

1551 ACGCGACGGT AAAAT CGAC A TCACCATCCC CGTTGCCAAC CTGCAAAGCG 
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GTTCGCAACA CTTTACCGAC CACCTGAAAT CAGCCGACAT CTTCGATGCC 

GCCCAATATC CGGACATCCG CTTTGTTTCC ACCAAATTCA ACTTCAACGG 

CAAAAAACTG GTTTCCGTTG ACGGCAACCT GACCATGCAC GGCAAAACCG 

CCCCCGTCAA ACTCAAAGCC GAAAAATTCA ACTGCTACCA AAGCCCGATG 

GCGAAAACCG AAGTTTGCGG CGGCGACTTC AGCACCACCA TCGACCGCAC 

CAAATGGGGC GTGGACTACC TCGTTAACGT TGGTATGACC AAAAGCGTCC 

GCATCGACAT CCAAATCGAG GCAGCCAAAC AATAAAAGCT T 



MASPDVKSAD 
AAVSEENTGN 
PAGNMENQAP 
TNQAENNQTA 
THCKGDSCSG 
GLVADSVQMK 
ADTLIVDGEA 
SKGEMLAGTA 
GDGLHMGTQK 
RPTDAEKGGF 
VGGFYGLTGS 
AQYPDIRFVS 
AKTEVCGGDF 



TLSKPAAPVV 
GGAAATDKPK 
DAGESEQPAN 
GSQN PASSTN 
NNFLDEEVQL 
GINQYIIFYK 
VSLTGHSGNI 
VYNGEVLHFH 
FKAAIDGNGF 
GVFAGKKEQD 
VEFDQAKRDG 
TKFNFNGKKL 
STTIDRTKWG 



SEKETEAKED 
NEDEGAQNDM 
QPDMANTADG 
PSATNSGGDF 
KSEFEKLSDA 
PKPTSFARFR 
FAPEGNYRYL 
TENGRPSPSR 
KGTWTENGGG 
GSGGGGATYK 
KIDITIPVAN 
VSVDGNLTMH 
VDYLVNVGMT 



APQAGSQGQG 
PQNAADTDSL 
MQGDDPSAGG 
GRTNVGNSVV 
DKISNYKKDG 
RSARSRRSLP 
TYGAEKLPGG 
GRFAAKVDFG 
DVSGKFYGPA 
VDEYHANARF 
LQSGSQHFTD 
GKTAPVKLKA 
KSVRIDIQIE 



APSAQGGQDM 
TPNHTPASNM 
ENAGNTAAQG 
IDGPSQNITL 
KNDGKNDKFV 
AEMPLIPVNQ 
SYALRVQGEP 
SKSVDGIIDS 
GEEVAGKYSY 
AIDHFNTSTN 
HLKSADIFDA 
EKFNCYQSPM 
AAKQ* 
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AG287NZ-961 



10 



15 



20 



25 



30 



35 



40 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 



ATGGCTAGCC 
CCCTGTTGTT 
CAGGTTCTCA 
GCGGCGGTTT 
CAAACCCAAA 
CCGCCGATAC 
CCGGCCGGAA 
GCCGGCAAAC 
ACGATCCGTC 
ACAAATCAAG 
TTCAACCAAT 
ACGTGGGCAA 
ACCCACTGTA 
AGTACAGCTA 
GTAATTACAA 
GGTTTGGTTG 
CTTTTATAAA 
GGTCGAGGCG 
GCGGATACGC 
CGGCAATATC 
CGGAAAAATT 
TCAAAAGGCG 
GCATTTTCAT 
CCGCAAAAGT 
GGCGATGGTT 
AAACGGCTTT 
GAAAGTTTTA 
CGCCCAACAG 
AGAGCAGGAT 
AAAAAGCTGC 
ATCAACGGTT 
CACAATTACC 
TTAAAGGTCT 
AATGAAAACA 
AAT AG AAAAG 
ATACTGATGC 
GAAAATATAA 
TGATGAAAAA 
CATTCAACGA 
GAAGCCGTCA 
ACAAAACGTC 
CCGAAGCTGC 
GTCGCTGCAA 



CCGATGTCAA 
TCTGAAAAAG 
AGGACAGGGC 
CGGAAGAAAA 
AATGAAGACG 
AGATAGTTTG 
AT AT G G AAAA 
CAACCGGATA 
GGCAGGCGGG 
CCGAAAACAA 
CCTAGCGCCA 
TTCTGTTGTG 
AAGGCGATTC 
AAATCAGAAT 
GAAAGATGGG 
CCGATAGTGT 
CCTAAACCCA 
GTCGCTTCCG 
TGATTGTCGA 
TTCGCGCCCG 
GCCCGGCGGA 
AAATGCTCGC 
ACGGAAAACG 
CGATTTCGGC 
TGCATATGGG 
AAGGGGACTT 
CGGCCCGGCC 
ATGCGGAAAA 
GGATCCGGAG 
CACTGTGGCC 
TCAAAGCTGG 
AAAAAAGACG 
GGGTCTGAAA 
AACAAAACGT 
TTAACAACCA 
CGCTCTGGAT 
CGACATTTGC 
TTAGAAGCCG 
TATCGCCGAT 
AAACCGCCAA 
GATGCCAAAG 
CGCTGGCACA 
AAGTTACCGA 



GTCGGCGGAC 
AGACAGAGGC 
GCGCCATCCG 
TACAGGCAAT 
AGGGGGCGCA 
ACACCGAATC 
CCAAGCACCG 
TGGCAAATAC 
GAAAATGCCG 
TCAAACCGCC 
CGAATAGCGG 
ATTGACGGGC 
TTGTAGTGGC 
TTGAAAAATT 
AAGAATGACG 
GCAGATGAAG 
CTTCATTTGC 
GCCGAGATGC 
TGGGGAAGCG 
AAGGGAATTA 
TCGTATGCCC 
GGGCACGGCA 
GCCGTCCGTC 
AGCAAATCTG 
TACGCAAAAA 
GGACGGAAAA 
GGCGAGGAAG 
GGGCGGATTC 
GAG GAG GAG C 
ATTGCTGCTG 
AGAGACCATC 
CAACTGCAGC 
AAAGTCGTGA 
CGATGCCAAA 
AGTTAGCAGA 
GCAACCACCA 
TGAAGAGACT 
TGGCTGATAC 
TCATTGGATG 
TGAAGCCAAA 
TAAAAGCTGC 
GCTAATACTG 
CATCAAAGCT 



ACGCTGTCAA 
AAAGGAAGAT 
CACAAGGCGG 
GGCGGTGCGG 
AAATGATATG 
ACACCCCGGC 
GATGCCGGGG 
GGCGGACGGA 
GCAATACGGC 
GGTTCTCAAA 
TGGTGATTTT 
CGTCGCAAAA 
AATAATTTCT 
AAGTGATGCA 
GGAAGAATGA 
GGAATCAATC 
GCGATTTAGG 
CGCTGATTCC 
GTCAGCCTGA 
CCGGTATCTG 
TCCGTGTTCA 
GTGTACAACG 
CCCGTCCAGA 
TGGACGGCAT 
TTCAAAGCCG 
TGGCGGCGGG 
TGGCGGGAAA 
GGCGTGTTTG 
CACAAACGAC 
CCTACAACAA 
TACGACATTG 
CGATGTTGAA 
CTAACCTGAC 
GTAAAAGCTG 
CACTGATGCC 
ACGCCTTGAA 
AAGACAAATA 
CGTCGACAAG 
AAACCAACAC 
CAGACGGCCG 
AGAAACTGCA 
CAGCCGACAA 
GATATCGCTA 



AACCTGCCGC 
GCGCCACAGG 
TCAAGATATG 
CAGCAACGGA 
CCGCAAAATG 
T T CG AAT AT G 
AATCGGAGCA 
ATGCAGGGTG 
TGCCCAAGGT 
ATCCTGCCTC 
GGAAGGACGA 
TATAACGTTG 
TGGATGAAGA 
GACAAAATAA 
TAAATTTGTC 
AATATATTAT 
CGTTCTGCAC 
CGTCAATCAG 
CGGGGCATTC 
ACTTACGGGG 
AGGCGAACCT 
GCGAAGTGCT 
GGCAGGTTTG 
TATCGACAGC 
C CATC GAT GG 
GATGTTTCCG 
ATACAGCTAT 
CCGGCAAAAA 
GACGATGTTA 
TGGCCAAGAA 
ATGAAGACGG 
GCCGACGACT 
CAAAACCGTC 
CAGAATCTGA 
GCTTTAGCAG 
TAAATTGGGA 
TCGTAAAAAT 
CATGCCGAAG 
TAAGGCAGAC 
AAGAAACCAA 
GCAGGCAAAG 
GGCCGAAGCT 
CGAACAAAGA 



45 



50 



2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 



TAATATTGCT 
CTGACAGCAA 
TTGGACACAC 
TCGCCTGAAC 
GCCAAGGCCT 
AACGTGGGTC 
ATCGGCAGTC 
CCAAAGCAGG 
CATGTCGGCG 



AAAAAAGCAA 
ATTTGTCAGA 
GCTTGGCTTC 
GGTTTGGATA 
TGCAGAACAA 
GGTTCAATGT 
GCCATCGGTA 
CGTGGCAGTC 
TCAATTACGA 



ACAGTGCCGA 
ATTGATGGTC 
TGCTGAAAAA 
AAACAGTGTC 
GCCGCGCTCT 
AACGGCTGCA 
CCGGCTTCCG 
GGCACTTCGT 
GTGGTAAAAG 



CGTGTACACC 
TGAACGCTAC 
TCCATTGCCG 
AGACCTGCGC 
CCGGTCTGTT 
GTCGGCGGCT 
CTTTACCGAA 
CCGGTTCTTC 
CTT 



AGAGAAGAGT 
TACCGAAAAA 
ATCACGATAC 
AAAGAAACCC 
CCAACCTTAC 
ACAAATCCGA 
AACTTTGCCG 
CGCAGCCTAC 
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1 MASPDVKSAD TLSKPAAPVV SEKETEAKED APQAGSQGQG APSAQGGQDM 

51 AAVSEENTGN GGAAATDKPK NEDEGAQNDM PQNAADTDSL TPNHTPASNM 

101 PAGNMENQAP DAGESEQPAN QPDMANTADG MQGDDPSAGG ENAGNTAAQG 

151 TNQAENNQTA GSQNPASSTN PSATNSGGDF GRTNVGNSVV IDGPSQNITL 

201 THCKGDSCSG NNFLDEEVQL KSEFEKLSDA DKISNYKKDG KNDGKNDKFV 

251 GLVADSVQMK GINQYIIFYK PKPTSFARFR RSARSRRSLP AEMPLIPVNQ 

301 ADTLIVDGEA VSLTGHSGNI FAPEGNYRYL TYGAEKLPGG SYALRVQGEP 

351 SKGEMLAGTA VYNGEVLHFH TENGRPSPSR GRFAAKVDFG SKSVDGIIDS 

4 01 GDGLHMGTQK FKAAIDGNGF KGTWTENGGG DVSGKFYGPA GEEVAGKYSY 

4 51 RPTDAEKGGF GVFAGKKEQD GSGGGGATND D D VKKAAT V A IAAAYNNGQE 

501 INGFKAGETI YDIDEDGTIT KKDATAADVE ADDFKGLGLK KVVTNLTKTV 

551 NENKQNVDAK VKAAESEIEK LTTKLADTDA ALADTDAALD ATTNALNKLG 

601 ENITTFAEET KTNIVKIDEK LEAVADTVDK HAEAFNDIAD SLDETNTKAD 

651 EAVKTANEAK QTAEETKQNV D AKVKAAE T A AGKAEAAAGT ANTAADKAEA 

701 VAAKVTDIKA DIATNKDNIA KKANSADVYT REESDSKFVR IDGLNATTEK 

751 LDTRLASAEK SIADHDTRLN GLDKTVSDLR KETRQGLAEQ AALSGLFQPY 

801 NVGRFNVTAA VGGYKSESAV AIGTGFRFTE NFAAKAGVAV GTSSGSSAAY 

851 HVGVNYEW* 



AG983 and hybrids 

[0223] Bactericidal titres generated in response to AG983 (His-fusion) were measured against variousstrains, including 
the homologous 2996 strain: 





2996 


NGH38 


BZ133 


AG983 


512 


128 


128 



[0224] AG983 was also expressed as a hybrid, with ORF46.1 , 741 , 961 or 961c at its C-terminus: 



AG983-ORF4 6.1 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 



ATGACTTCTG 
CAGCAGAGCA 
AGAACGAAAT 
GTTGCGGTTA 
GCATACCGGA 
ACCTCAAACC 
GGTATCGTCG 
GTATGGCAGA 
CGTATATGCG 
GCTTCTTTCG 
TATCCGCCAC 
TTGGCGGGCG 
GCGACGCTAC 
GGTTGCAGCC 
GCATCGTCAA 
CTTTTCCAAA 
CTATTCCGGC 
GCGATTACGG 
ATCTTTTCGA 
ATTGCCATTT 
GCGTAGACCG 
GGTACAGAAC 
GTGGTGCCTG 



CGCCCGACTT 
AC AAC AG C G A 
GTGCAAAGAC 
C AG AC AG G G A 
GACTTTCCAA 
TGCAATTGAA 
ACACAGGCGA 
AAAGAACACG 
GAAGGAAGCG 
ACGATGAGGC 
GTAAAAGAAA 
TTCCGTGGAC 
ACATAATGAA 
ATCCGCAATG 
TAACAGTTTT 
TAGCCAATTC 
GGTGATAAAA 
CAACCTGTCC 
CAGGCAATGA 
TAT GAAAAAG 
CAGTGGAGAA 
CGCTTGAGTA 
TCGGCACCCT 



CAATGCAGGC 
AATCAGCAGC 
AGAAGCATGC 
TGCCAAAATC 
ACCCAAATGA 
GCAGGCTATA 
ATCCGTCGGC 
GCTATAACGA 
CCTGAAGACG 
CGTTATAGAG 
TCGGACACAT 
GGCAGACCTG 
TACGAATGAT 
CATGGGTCAA 
GGAACAACAT 
GGAGGAGCAG 
CAGACGAGGG 
TACCACATCC 
CGCACAAGCT 
ACGCTCAAAA 
AAGTTCAAAC 
TGGCTCCAAC 
ATGAAGCAAG 



GGTACCGGTA 
AGTATCTTAC 
TCTGTGCCGG 
AATGCCCCCC 
CG CAT AC AAG 
CAGGACGCGG 
AGCATATCCT 
AAAT T AC AAA 
GAGGCGGTAA 
ACTGAAGCAA 
CGATTTGGTC 
CAGGCGGTAT 
GAAACCAAGA 
GCTGGGCGAA 
CGAGGGCAGG 
TACCGCCAAG 
TATCCGCCTG 
GTAATAAAAA 
CAGCCCAACA 
AGGCATTATC 
GGGAAATGTA 
CATTGCGGAA 
CGTCCGTTTC 



TCGGCAGCAA 
GCCGGTATCA 
TCGGGATGAC 
CCCCGAATCT 
AATTTGATCA 
GGTAGAGGTA 
TTCCCGAACT 
AACTATACGG 
AGACATTGAA 
AGCCGACGGA 
TCCCATATTA 
TGCGCCCGAT 
ACGAAATGAT 
CGTGGCGTGC 
CACTGCCGAC 
CGTTGCTCGA 
ATGCAACAGA 
CATGCTTTTC 
CATATGCCCT 
ACAGTCGCAG 
TGGAGAACCG 
TTACTGCCAT 
ACCCGTACAA 
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25 



1151 ACCCGATTCA AATTGCCGGA ACATCCTTTT CCGCACCCAT CGTAACCGGC 

1201 ACGGCGGCTC TGCTGCTGCA GAAATACCCG TGGATGAGCA ACGACAACCT 

1251 GCGTACCACG TTGCTGACGA CGGCTCAGGA CATCGGTGCA GTCGGCGTGG 

1301 ACAGCAAGTT CGGCTGGGGA CTGCTGGATG CGGGTAAGGC CATGAACGGA 

5 1351 CCCGCGTCCT TTCCGTTCGG CGACTTTACC GCCGATACGA AAGGTACATC 

1401' CGATATTGCC TACTCCTTCC GTAACGACAT TTCAGGCACG GGCGGCCTGA 

1451 TCAAAAAAGG CGGCAGCCAA CTGCAACTGC ACGGCAACAA CACCTATACG 

1501 GGCAAAACCA TTATCGAAGG CGGTTCGCTG GTGTTGTACG GCAACAACAA 

1551 ATCGGATATG CGCGTCGAAA CCAAAGGTGC GCTGATTTAT AACGGGGCGG 

1601 CATCCGGCGG CAGCCTGAAC AGCGACGGCA TTGTCTATCT GGCAGATACC 

10 1651 GACCAATCCG GCGCAAACGA AACCGTACAC ATCAAAGGCA GTCTGCAGCT 

1701 GGACGGCAAA GGTACGCTGT ACACACGTTT GGGCAAACTG CTGAAAGTGG 

1751 ACGGTACGGC GATTATCGGC GGCAAGCTGT ACATGTCGGC ACGCGGCAAG 

1801 GGGGCAGGCT ATCTCAACAG TACCGGACGA CGTGTTCCCT TCCTGAGTGC 

1851 CGCCAAAATC GGGCAGGATT ATTCTTTCTT CACAAACATC GAAACCGACG 

1901 GCGGCCTGCT GGCTTCCCTC GACAGCGTCG AAAAAACAGC GGGCAGTGAA 

75 1951 GGCGACACGC TGTCCTATTA TGTCCGTCGC GGCAATGCGG CACGGACTGC 

2001 TTCGGCAGCG GCACATTCCG CGCCCGCCGG TCTGAAACAC GCCGTAGAAC 

2051 AGGGCGGCAG CAATCTGGAA AACCTGATGG TCGAACTGGA TGCCTCCGAA 

2101 TCATCCGCAA CACCCGAGAC GGTTGAAACT GCGGCAGCCG ACCGCACAGA 

2151 TATGCCGGGC ATCCGCCCCT ACGGCGCAAC TTTCCGCGCA GCGGCAGCCG 

2201 TACAGCATGC GAATGCCGCC GACGGTGTAC GCATCTTCAA CAGTCTCGCC 

2Q 2251 GCTACCGTCT ATGCCGACAG TACCGCCGCC CATGCCGATA TGCAGGGACG 

2301 CCGCCTGAAA GCCGTATCGG ACGGGTTGGA CCACAACGGC ACGGGTCTGC 

2351 GCGTCATCGC GCAAACCCAA CAGGACGGTG GAACGTGGGA ACAGGGCGGT 

24 01 GTTGAAGGCA AAATGCGCGG CAGTACCCAA ACCGTCGGCA TTGCCGCGAA 

24 51 AACCGGCGAA AATACGACAG CAGCCGCCAC ACTGGGCATG GGACGCAGCA 

2501 CATGGAGCGA AAACAGTGCA AATGCAAAAA CCGACAGCAT TAGTCTGTTT 

2551 GCAGGCATAC GGCACGATGC GGGCGATATC GGCTATCTCA AAGGCCTGTT 

2601 CTCCTACGGA CGCTACAAAA ACAGCATCAG CCGCAGCACC GGTGCGGACG 

2 651 AACATGCGGA AGGCAGCGTC AACGGCACGC TGATGCAGCT GGGCGCACTG 

2701 GGCGGTGTCA ACGTTCCGTT TGCCGCAACG GGAGATTTGA CGGTCGAAGG 

27 51 CGGTCTGCGC TACGACCTGC TCAAACAGGA TGCATTCGCC GAAAAAGGCA 

2801 GTGCTTTGGG CTGGAGCGGC AACAGCCTCA CTGAAGGCAC GCTGGTCGGA 

2851 CTCGCGGGTC TGAAGCTGTC GCAACCCTTG AGCGATAAAG CCGTCCTGTT 

30 2901 TGCAACGGCG GGCGTGGAAC GCGACCTGAA CGGACGCGAC TACACGGTAA 

2951 CGGGCGGCTT TACCGGCGCG AC TG C AG C AA CCGGCAAGAC GGGGGCACGC 

3001 AATATGCCGC ACACCCGTCT GGTTGCCGGC CTGGGCGCGG ATGTCGAATT 

3051 CGGCAACGGC TGGAACGGCT TGGCACGTTA CAGCTACGCC GGTTCCAAAC 

3101 AGTACGGCAA CCACAGCGGA CGAGTCGGCG TAGGCTACCG GTTCCTCGAC 

3151 GGTGGCGGAG GCACTGGATC CTCAGATTTG GCAAACGATT CTTTTATCCG 

35 3201 GCAGGTTCTC GACCGTCAGC ATTTCGAACC CGACGGGAAA TACCACCTAT 

3251 TCGGCAGCAG GGGGGAACTT GCCGAGCGCA GCGGCCATAT CGGATTGGGA 

3301 AAAATACAAA GCCATCAGTT GGGCAACCTG ATGATTCAAC AGGCGGCCAT 

3351 TAAAGGAAAT ATCGGCTACA TTGTCCGCTT TTCCGATCAC GGGCACGAAG 

34 01 TCCATTCCCC CTTCGACAAC CATGCCTCAC ATTCCGATTC TGATGAAGCC 

34 51 GGTAGTCCCG TTGACGGATT TAGCCTTTAC CGCATCCATT GGGACGGATA 

40 3501 CGAACACCAT CCCGCCGACG GCTATGACGG GCCACAGGGC GGCGGCTATC 

3551 CCGCTCCCAA AGGCGCGAGG GATATATACA GCTACGACAT AAAAGGCGTT 

3601 GCCCAAAATA TCCGCCTCAA CCTGACCGAC AACCGCAGCA CCGGACAACG 

3651 GCTTGCCGAC CGTTTCCACA ATGCCGGTAG TATGCTGACG CAAGGAGTAG 

3701 GCGACGGATT CAAACGCGCC ACCCGATACA GCCCCGAGCT GGACAGATCG 

37 51 GGCAATGCCG CCGAAGCCTT CAACGGCACT G C AG AT AT CG TTAAAAACAT 

45 3801 CATCGGCGCG G C AG GAG AAA TTGTCGGCGC AGGCGATGCC GTGCAGGGCA 

3851 TAAGCGAAGG CTCAAACATT GCTGTCATGC ACGGCTTGGG TCTGCTTTCC 

3901 ACCGAAAACA AGATGGCGCG CATCAACGAT TTGGCAGATA TGGCGCAACT 

3951 CAAAGACTAT GCCGCAGCAG CCATCCGCGA TTGGGCAGTC CAAAACCCCA 

4 001 ATGCCGCACA AGGCATAGAA GCCGTCAGCA ATATCTTTAT GGCAGCCATC 

4 051 CCCATCAAAG GGATTGGAGC TGTTCGGGGA AAATACGGCT TGGGCGGCAT 

50 4101 CACGGCACAT CCTATCAAGC GGTCGCAGAT GGGCGCGATC GCATTGCCGA 

4151 AAGGGAAATC CGCCGTCAGC GACAATTTTG CCGATGCGGC ATACGCCAAA 

4201 TACCCGTCCC CTTACCATTC CCGAAATATC CGTTCAAACT TGGAGCAGCG 

4 251 TTACGGCAAA GAAAACAT C A CCTCCTCAAC CGTGCCGCCG TCAAACGGCA 

4 301 AAAATGTCAA ACTGGCAGAC CAACGCCACC CGAAGACAGG CGTACCGTTT 

4 351 GACGGTAAAG GGTTTCCGAA TTTTGAGAAG CACGTGAAAT ATGATACGCT 

55 4 4 01 CGAGCACCAC CACCACCACC ACTGA 
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MTSAPDFNAG 
VAVTDRDAKI 
GIVDTGESVG 
ASFDDEAVIE 
ATLHIMNTND 
LFQIANSEEQ 
IFSTGNDAQA 
GTEPLEYGSN 
TAALLLQKYP 
PASFPFGDFT 
GKTIIEGGSL 
DQSGANETVH 
GAGYLNSTGR 
GDTLSYYVRR 
SSATPETVET 
ATVYADSTAA 
VEGKMRGSTQ 
AGIRHDAGDI 
GGVNVPFAAT 
LAGLKLSQPL 
NMPHTRLVAG 
GGGGTGSSDL 
KIQSHQLGNL 
GSPVDGFSLY 
AQNIRLNLTD 
GNAAEAFNGT 
TENKMARIND 
PIKGIGAVRG 
YPSPYHSRNI 
DGKGFPN FEK 



GTGIGSNSRA 
NAPPPNLHTG 
SISFPELYGR 
TEAKPTDIRH 
ETKNEMMVAA 
YRQALLDYSG 
QPNTYALLPF 
HCGITAMWCL 
WMSNDNLRTT 
ADTKGTSDIA 
VLYGNNKSDM 
IKGSLQLDGK 
RVPFLSAAKI 
GNAARTASAA 
AAADRTDMPG 
HADMQGRRLK 
TVGIAAKTGE 
GYLKGLFSYG 
GDLTVEGGLR 
SDKAVLFATA 
LGADVEFGNG 
ANDSFIRQVL 
MIQQAAIKGN 
RIHWDGYEHH 
NRSTGQRLAD 
ADIVKNI IGA 
LADMAQLKDY 
KYGLGGITAH 
RSNLEQRYGK 
HVKYDTLEHH 



TTAKSAAVSY 
DFPN PNDAYK 
KEHGYNENYK 
VKEIGHIDLV 
IRNAWVKLGE 
GDKTDEGIRL 
YEKDAQKGII 
SAPYEASVRF 
LLTTAQDIGA 
YSFRNDISGT 
RVETKGALIY 
GTLYTRLGKL 
GQDYSFFTNI 
AHSAPAGLKH 
IRPYGAT FRA 
AVSDGLDHNG 
NTTAAATLGM 
RYKNSISRST 
YDLLKQDAFA 
GVERDLNGRD 
WNGLARYSYA 
DRQHFEPDGK 
IGYIVRFSDH 
PADGYDGPQG 
RFHNAGSMLT 
AGE I VG AG DA 
AAAAIRDWAV 
PIKRSQMGAI 
ENITSSTVPP 
HHHH* 



AGIKNEMCKD 
NLINLKPAIE 
NYTAYMRKEA 
SHIIGGRSVD 
RGVRIVNNSF 
MQQSDYGNLS 
TVAGVDRSGE 
TRTNPIQIAG 
VGVDSKFGWG 
GGLIKKGGSQ 
NGAASGGSLN 
LKVDGTAIIG 
ETDGGLLASL 
AVEQGGSNLE 
AAAVQHANAA 
TGLRVIAQTQ 
GRSTWSENSA 
GADEHAEGSV 
EKGSALGWSG 
YTVTGGFTGA 
GSKQYGNHSG 
YHLFGSRGEL 
GHEVHSPFDN 
GGYPAPKGAR 
QGVGDGFKRA 
VQGISEGSNI 
QNPNAAQGIE 
ALPKGKSAVS 
SNGKNVKLAD 



RSMLCAGRDD 
AGYTGRGVEV 
PEDGGGKDIE 
GRPAGGIAPD 
GTTSRAGTAD 
YHIRNKNMLF 
KFKREMYGEP 
TSFSAPIVTG 
LLDAGKAMNG 
LQLHGNNTYT 
SDGIVYLADT 
GKLYMSARGK 
DSVEKTAGSE 
NLMVELDASE 
DGVRI FN SLA 
QDGGTWEQGG 
NAKTDSISLF 
NGTLMQLGAL 
NSLTEGTLVG 
TAATGKTGAR 
RVGVGYRFLD 
AERSGHIGLG 
HASHSDSDEA 
DIYSYDIKGV 
TRYSPELDRS 
AVMHGLGLLS 
AVSNIFMAAI 
DNFADAAYAK 
QRHPKTGVPF 
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741 

ATGACTTCTG 
CAGCAGAGCA 
AGAACGAAAT 
GTTGCGGTTA 
GCATACCGGA 
ACCTCAAACC 
GGTATCGTCG 
GTATGGCAGA 
CGTATATGCG 
GCTTCTTTCG 
TATCCGCCAC 
TTGGCGGGCG 
GCGACGCTAC 
GGTTGCAGCC 
GCATCGTCAA 
CTTTTCCAAA 
CTATTCCGGC 
GCGATTACGG 
ATCTTTTCGA 
ATTGCCATTT 
GCGTAGACCG 
GGTACAGAAC 
GTGGTGCCTG 
ACCCGATTCA 
ACGGCGGCTC 
GCGTACCACG 
ACAGCAAGTT 
CCCGCGTCCT 
CGATATTGCC 
TCAAAAAAGG 
GGCAAAACCA 
ATCGGATATG 
CATCCGGCGG 
GACCAATCCG 



CGCCCGACTT 
ACAACAGCGA 
GTGCAAAGAC 
CAGACAGGGA 
GACTTTCCAA 
TGCAATTGAA 
ACACAGGCGA 
AAAGAACACG 
GAAGGAAGCG 
AC GAT GAG GC 
GTAAAAGAAA 
TTCCGTGGAC 
ACATAATGAA 
ATCCGCAATG 
TAACAGTTTT 
TAGCCAATTC 
GGTGATAAAA 
CAACCTGTCC 
CAGGCAATGA 
TATGAAAAAG 
CAGTGGAGAA 
CGCTTGAGTA 
TCGGCACCCT 
AATTGCCGGA 
TGCTGCTGCA 
TTGCTGACGA 
CGGCTGGGGA 
TTCCGTTCGG 
TACTCCTTCC 
CGGCAGCCAA 
TTATCGAAGG 
CGCGTCGAAA 
CAGCCTGAAC 
GCGCAAACGA 



CAATGCAGGC 
AATCAGCAGC 
AGAAGCATGC 
TGCCAAAATC 
ACCCAAATGA 
GCAGGCTATA 
ATCCGTCGGC 
GCTATAACGA 
CCTGAAGACG 
C GT TAT AG AG 
TCGGACACAT 
GGCAGACCTG 
TACGAATGAT 
CATGGGTCAA 
GGAACAACAT 
GGAGGAGCAG 
CAGACGAGGG 
TACCACATCC 
CGCACAAGCT 
ACGCTCAAAA 
AAGTTCAAAC 
TGGCTCCAAC 
ATGAAGCAAG 
ACATCCTTTT 
GAAATACCCG 
CGGCTCAGGA 
CTGCTGGATG 
CGACTTTACC 
GTAACGACAT 
CTGCAACTGC 
CGGTTCGCTG 
CCAAAGGTGC 
AGCGACGGCA 
AACCGTACAC 



GGTACCGGTA 
AGTATCTTAC 
TCTGTGCCGG 
AATGCCCCCC 
CGCATACAAG 
CAGGACGCGG 
AGCATATCCT 
AAATTACAAA 
GAGGCGGTAA 
ACTGAAGCAA 
CGATTTGGTC 
CAGGCGGTAT 
GAAACCAAGA 
GCTGGGCGAA 
CGAGGGCAGG 
TACCGCCAAG 
TATCCGCCTG 
GTAATAAAAA 
CAGCCCAACA 
AGGCATTATC 
GGGAAATGTA 
CATTGCGGAA 
CGTCCGTTTC 
CCGCACCCAT 
TGGATGAGCA 
CATCGGTGCA 
CGGGTAAGGC 
GCCGATACGA 
TTCAGGCACG 
ACGGCAACAA 
GTGTTGTACG 
GCTGATTTAT 
TTGTCTATCT 
ATCAAAGGCA 



TCGGCAGCAA 
GCCGGTATCA 
TCGGGATGAC 
CCCCGAATCT 
AATTTGATCA 
GGTAGAGGTA 
TTCCCGAACT 
AACTATACGG 
AG AC AT T G AA 
AGCCGACGGA 
TCCCATATTA 
TGCGCCCGAT 
ACGAAATGAT 
CGTGGCGTGC 
CACTGCCGAC 
CGTTGCTCGA 
ATGCAACAGA 
CATGCTTTTC 
CATATGCCCT 
ACAGTCGCAG 
TGGAGAACCG 
TTACTGCCAT 
ACCCGTACAA 
CGTAACCGGC 
ACGACAACCT 
GTCGGCGTGG 
CATGAACGGA 
AAGGTACATC 
GGCGGCCTGA 
CACCTATACG 
GCAACAACAA 
AACGGGGCGG 
GGCAGATACC 
GTCTGCAGCT 
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GGACGGCAAA 
ACGGTACGGC 
GGGGCAGGCT 
CGCCAAAATC 
GCGGCCTGCT 
GGCGACACGC 
TTCGGCAGCG 
AGGGCGGCAG 
TCATCCGCAA 
TATGCCGGGC 
TACAGCATGC 
GCTACCGTCT 
CCGCCTGAAA 
GCGTCATCGC 
GTTGAAGGCA 
AACCGGCGAA 
CATGGAGCGA 
GCAGGCATAC 
CTCCTACGGA 
AACATGCGGA 
GGCGGTGTCA 
CGGTCTGCGC 
GTGCTTTGGG 
CTCGCGGGTC 
TGCAACGGCG 
CGGGCGGCTT 
AATATGCCGC 
CGGCAACGGC 
AGTACGGCAA 
GGATCCGGAG 
TGCACTAACC 
CGCTGGATCA 
GGTGCGGAAA 
GAAGAACGAC 
ACGGGCAGCT 
AGCCATTCCG 
GCATTCCGGG 
CGGGCGAACA 
TATCGCGGGA 
CACCATAGAT 
AATCGCCAGA 
GGAAAACGCC 
GAAAGGCAGT 
CCGGCAGCGC 
GCCGCCAAGC 



GGTACGCTGT 
GATTATCGGC 
ATCTCAACAG 
GGGCAGGATT 
GGCTTCCCTC 
TGTCCTATTA 
GCACATTCCG 
CAATCTGGAA 
CACCCGAGAC 
ATCCGCCCCT 
GAATGCCGCC 
ATGCCGACAG 
GCCGTATCGG 
GCAAACCCAA 
AAATGCGCGG 
AATACGACAG 
AAACAGTGCA 
GGCACGATGC 
CGCTACAAAA 
AGGCAGCGTC 
ACGTTCCGTT 
TACGACCTGC 
CTGGAGCGGC 
TGAAGCTGTC 
GGCGTGGAAC 
TACCGGCGCG 
ACACCCGTCT 
TGGAACGGCT 
C C AC AG CGG A 
GGGGTGGTGT 
GCACCGCTCG 
GTCCGTCAGG 
AAACTTATGG 
AAGGTCAGCC 
CATTACCTTG 
CCTTAACCGC 
AAGATGGTTG 
TACATCTTTT 
CGGCGTTCGG 
TTCGCCGCCA 
ACTCAATGTC 
ATGCCGTCAT 
TACTCCCTCG 
G G AAGT G AAA 
AACTCGAGCA 



ACACACGTTT 
GGCAAGCTGT 
TACCGGACGA 
ATTCTTTCTT 
GACAGCGTCG 
TGTCCGTCGC 
CGCCCGCCGG 
AACCTGATGG 
GGTTGAAACT 
ACGGCGCAAC 
GACGGTGTAC 
TACCGCCGCC 
ACGGGTTGGA 
CAGGACGGTG 
CAGTACCCAA 
CAGCCGCCAC 
AATGCAAAAA 
GGGCGATATC 
AC AG CATC AG 
AACGGCACGC 
TGCCGCAACG 
TCAAACAGGA 
AACAGCCTCA 
GCAACCCTTG 
GCGACCTGAA 
ACTGCAGCAA 
GGTTGCCGGC 
TGGCACGTTA 
CGAGTCGGCG 
CGCCGCCGAC 
ACCATAAAGA 
AAAAACGAGA 
AAACGGTGAC 
GTTTCGACTT 
GAGAGTGGAG 
CTTTCAGACC 
CGAAACGCCA 
GACAAGCTTC 
TTCAGACGAT 
AG C AG G G AAA 
GACCTGGCCG 
CAGCGGTTCC 
GTATCTTTGG 
ACCGTAAACG 
CCACCACCAC 



GGGCAAACTG 
ACATGTCGGC 
CGTGTTCCCT 
CACAAACATC 
AAAAAACAGC 
GGCAATGCGG 
TCTGAAACAC 
TCGAACTGGA 
GCGGCAGCCG 
TTTCCGCGCA 
GCATCTTCAA 
CATGCCGATA 
CCACAACGGC 
GAACGTGGGA 
ACCGTCGGCA 
ACTGGGCATG 
CCGACAGCAT 
GGCTATCTCA 
CCGCAGCACC 
TGATGCAGCT 
GGAGATTTGA 
TGCATTCGCC 
CTGAAGGCAC 
AGCGATAAAG 
CGGACGCGAC 
CCGGCAAGAC 
CTGGGCGCGG 
CAGCTACGCC 
TAGGCTACCG 
ATCGGTGCGG 
CAAAGGTTTG 
AACTGAAGCT 
AGCCTCAATA 
TATCCGCCAA 
AGTTCCAAGT 
GAGCAAATAC 
GTTCAGAATC 
CCGAAGGCGG 
GCCGGCGGAA 
CGGCAAAATC 
CCGCCG AT AT 
GTCCTTTACA 
CGGAAAAGCC 
GCATACGCCA 
CACCACTGA 



CTGAAAGTGG 
ACGCGGCAAG 
TCCTGAGTGC 
GAAACCGACG 
GGGCAGTGAA 
CACGGACTGC 
GCCGTAGAAC 
TGCCTCCGAA 
ACCGCACAGA 
GCGGCAGCCG 
CAGTCTCGCC 
TGCAGGGACG 
ACGGGTCTGC 
ACAGGGCGGT 
TTGCCGCGAA 
GGACGCAGCA 
TAGTCTGTTT 
AAGGCCTGTT 
GGTGCGGACG 
GGGCGCACTG 
CGGTCGAAGG 
G AAAAAGG C A 
GCTGGTCGGA 
CCGTCCTGTT 
TACACGGTAA 
GGGGGCACGC 
ATGTCGAATT 
GGTTCCAAAC 
GTTCCTCGAG 
GGCTTGCCGA 
CAGTCTTTGA 
GGCGGCACAA 
CGGGCAAATT 
ATCGAAGTGG 
ATACAAACAA 
AAGATTCGGA 
GGCGACATAG 
CAGGGCGACA 
AACTGACCTA 
GAACATTTGA 
CAAGCCGGAT 
ACCAAGCCGA 
CAGGAAGTTG 
TATCGGCCTT 
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MTSAPDFNAG 
VAVTDRDAKI 
GIVDTGESVG 
ASFDDEAVIE 
ATLHIMNTND 
LFQIANSEEQ 
IFSTGNDAQA 
GTEPLEYGSN 
TAALLLQKYP 
PASFPFGDFT 
GKTIIEGGSL 
DQSGANETVH 
GAGYLNSTGR 
GDTLSYYVRR 
S SAT PET VET 
ATVYADSTAA 
VEGKMRGSTQ 
AGIRHDAGDI 
GGVNVPFAAT 
LAGLKLSQPL 
NMPHTRLVAG 



GTGIGSNSRA 
NAPPPNLHTG 
SISFPELYGR 
TEAKPTDIRH 
ETKNEMMVAA 
YRQALLDYSG 
QPNTYALLPF 
HCGITAMWCL 
WMSNDNLRTT 
ADTKGTSDIA 
VLYGNNKSDM 
IKGSLQLDGK 
RVPFLSAAKI 
GNAARTASAA 
AAADRTDMPG 
HADMQGRRLK 
TVGIAAKTGE 
GYLKGLFSYG 
GDLTVEGGLR 
SDKAVLFATA 
LGADVE FGNG 



TTAKSAAVSY 
DFPN PNDAYK 
KEHGYNENYK 
VKEIGHIDLV 
IRNAWVKLGE 
GDKTDEGIRL 
YEKDAQKGII 
SAPYEASVRF 
LLTTAQDIGA 
YSFRNDISGT 
RVETKGALIY 
GTLYTRLGKL 
GQDYSFFTNI 
AHSAPAGLKH 
IRPYGATFRA 
AVSDGLDHNG 
NTTAAATLGM 
RYKNSISRST 
YDLLKQDAFA 
GVERDLNGRD 
WNGLARYSYA 



AGIKNEMCKD 
NLINLKPAIE 
NYTAYMRKEA 
SHIIGGRSVD 
RGVRI VNNSF 
MQQSDYGNLS 
TVAGVDRSGE 
TRTNPIQIAG 
VGVDSKFGWG 
GGLIKKGGSQ 
NGAASGGSLN 
LKVDGTAIIG 
ETDGGLLASL 
AVEQGGSNLE 
AAAVQHANAA 
TGLRVIAQTQ 
GRSTWSENSA 
GADEHAEGSV 
EKGSALGWSG 
YTVTGGFTGA 
GSKQYGNHSG 



RSMLCAGRDD 
AGYTGRGVEV 
PEDGGGKDIE 
GRPAGGIAPD 
GTTSRAGTAD 
YHIRNKNMLF 
KFKREMYGEP 
TSFSAPIVTG 
LLDAGKAMNG 
LQLHGNNTYT 
SDGIVYLADT 
GKLYMSARGK 
DSVEKTAGSE 
NLMVELDASE 
DGVRIFNSLA 
QDGGTWEQGG 
NAKTDSISLF 
NGTLMQLGAL 
NSLTEGTLVG 
TAATGKTGAR 
RVGVGYRFLE 



GSGGGGVAAD IGAGLADALT APLDHKDKGL QSLTLDQSVR KNEKLKLAAQ 
GAEKTYGNGD SLNTGKLKND KVSRFDFIRQ IEVDGQLITL ESGEFQVYKQ 
SHSALTAFQT EQIQDSEHSG KMVAKRQFRI GDIAGEHTSF DKLPEGGRAT 
YRGTAFGSDD AGGKLTYTID FAAKQGNGKI EHLKSPELNV DLAAADIKPD 
GKRHAVISGS VLYNQAEKGS YSLGIFGGKA QEVAGSAEVK TVNGIRHIGL 
AAKQLEHHHH HH* 
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2651 
2701 
2751 
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ATGACTTCTG 
CAGCAGAGCA 
AGAACGAAAT 
GTTGCGGTTA 
GCATACCGGA 
ACCTCAAACC 
GGTATCGTCG 
GTATGGCAGA 
CGTATATGCG 
GCTTCTTTCG 
TATCCGCCAC 
TTGGCGGGCG 
GCGACGCTAC 
GGTTGCAGCC 
GCATCGTCAA 
CTTTTCCAAA 
CTATTCCGGC 
GCGATTACGG 
ATCTTTTCGA 
ATTGCCATTT 
GCGTAGACCG 
GGTACAGAAC 
GTGGTGCCTG 
ACCCGATTCA 
ACGGCGGCTC 
GCGTACCACG 
ACAGCAAGTT 
CCCGCGTCCT 
CGATATTGCC 
TCAAAAAAGG 
GGCAAAACCA 
ATCGGATATG 
CATCCGGCGG 
GACCAATCCG 
GGACGGCAAA 
ACGGTACGGC 
GGGGCAGGCT 
CGCCAAAATC 
GCGGCCTGCT 
GGCGACACGC 
TTCGGCAGCG 
AGGGCGGCAG 
TCATCCGCAA 
TATGCCGGGC 
TACAGCATGC 
GCTACCGTCT 
CCGCCTGAAA 
GCGTCATCGC 
GTTGAAGGCA 
AACCGGCGAA 
CATGGAGCGA 
GCAGGCATAC 
CTCCTACGGA 
AACATGCGGA 
GGCGGTGTCA 
CGGTCTGCGC 
GTGCTTTGGG 
CTCGCGGGTC 



CGCCCGACTT 
ACAACAGCGA 
GTGCAAAGAC 
CAGACAGGGA 
GACTTTCCAA 
TGCAATTGAA 
ACACAGGCGA 
AAAGAACACG 
GAAGGAAGCG 
ACGATGAGGC 
GTAAAAGAAA 
TTCCGTGGAC 
ACATAATGAA 
ATCCGCAATG 
TAACAGTTTT 
TAGCCAATTC 
GGTGATAAAA 
CAACCTGTCC 
CAGGCAATGA 
T AT G AAAAAG 
CAGTGGAGAA 
CGCTTGAGTA 
TCGGCACCCT 
AATTGCCGGA 
TGCTGCTGCA 
TTGCTGACGA 
CGGCTGGGGA 
TTCCGTTCGG 
TACTCCTTCC 
CGGCAGCCAA 
TTATCGAAGG 
CGCGTCGAAA 
CAGCCTGAAC 
GCGCAAACGA 
GGTACGCTGT 
GATTATCGGC 
ATCTCAACAG 
GGGCAGGATT 
GGCTTCCCTC 
TGTCCTATTA 
GCACATTCCG 
CAATCTGGAA 
CACCCGAGAC 
ATCCGCCCCT 
GAATGCCGCC 
ATGCCGACAG 
GCCGTATCGG 
GCAAACCCAA 
AAATGCGCGG 
AATACGACAG 
AAACAGTGCA 
GGCACGATGC 
CGCTACAAAA 
AGGCAGCGTC 
ACGTTCCGTT 
TACGACCTGC 
CTGGAGCGGC 
TGAAGCTGTC 



CAATGCAGGC 
AATCAGCAGC 
AGAAGCATGC 
TGCCAAAATC 
ACCCAAATGA 
GCAGGCTATA 
ATCCGTCGGC 
GCTATAACGA 
CCTGAAGACG 
CGTTATAGAG 
TCGGACACAT 
GGCAGACCTG 
TACGAATGAT 
CATGGGTCAA 
GGAACAACAT 
G GAG GAG C AG 
CAGACGAGGG 
TACCACATCC 
CGCACAAGCT 
ACGCTCAAAA 
AAGTTCAAAC 
TGGCTCCAAC 
ATGAAGCAAG 
ACATCCTTTT 
GAAATACCCG 
CGGCTCAGGA 
CTGCTGGATG 
CGACTTTACC 
GTAACGACAT 
CTGCAACTGC 
CGGTTCGCTG 
CCAAAGGTGC 
AGCGACGGCA 
AACCGTACAC 
ACACACGTTT 
GGCAAGCTGT 
TACCGGACGA 
ATTCTTTCTT 
GACAGCGTCG 
TGTCCGTCGC 
CGCCCGCCGG 
AACCTGATGG 
GGTTGAAACT 
ACGGCGCAAC 
GACGGTGTAC 
TACCGCCGCC 
ACGGGTTGGA 
CAGGACGGTG 
CAGTACCCAA 
CAGCCGCCAC 
AATGCAAAAA 
GGGCGATATC 
ACAGCATCAG 
AACGGCACGC 
TGCCGCAACG 
TCAAACAGGA 
AACAGCCTCA 
GCAACCCTTG 



GGTACCGGTA 
AGTATCTTAC 
TCTGTGCCGG 
AATGCCCCCC 
CGCATACAAG 
CAGGACGCGG 
AGCATATCCT 
AAATTACAAA 
GAGGCGGTAA 
ACTGAAGCAA 
CGATTTGGTC 
CAGGCGGTAT 
GAAACCAAGA 
GCTGGGCGAA 
CGAGGGCAGG 
TACCGCCAAG 
TATCCGCCTG 
GTAATAAAAA 
CAGCCCAACA 
AGGCATTATC 
GGGAAATGTA 
CATTGCGGAA 
CGTCCGTTTC 
CCGCACCCAT 
TGGATGAGCA 
CATCGGTGCA 
CGGGTAAGGC 
GCCGATACGA 
TTCAGGCACG 
ACGGCAACAA 
GTGTTGTACG 
GCTGATTTAT 
TTGTCTATCT 
ATCAAAGGCA 
GGGCAAACTG 
ACATGTCGGC 
CGTGTTCCCT 
C AC AAAC AT C 
AAAAAACAGC 
GGCAATGCGG 
TCTGAAACAC 
TCGAACTGGA 
GCGGCAGCCG 
TTTCCGCGCA 
GCATCTTCAA 
CATGCCGATA 
CCACAACGGC 
GAACGTGGGA 
ACCGTCGGCA 
ACTGGGCATG 
CCGACAGCAT 
GGCTATCTCA 
CCGCAGCACC 
TGATGCAGCT 
GGAGATTTGA 
TGCATTCGCC 
CTGAAGGCAC 
AGCGATAAAG 



T CGGC AGCAA 
GCCGGTATCA 
TCGGGATGAC 
CCCCGAATCT 
AATTTGATCA 
GGTAGAGGTA 
TTCCCGAACT 
AACTATACGG 
AGACATTGAA 
AGCCGACGGA 
TCCCATATTA 
TGCGCCCGAT 
ACGAAATGAT 
CGTGGCGTGC 
CACTGCCGAC 
CGTTGCTCGA 
ATGCAACAGA 
CATGCTTTTC 
CATATGCCCT 
ACAGTCGCAG 
TGGAGAACCG 
TTACTGCCAT 
ACCCGTACAA 
CGTAACCGGC 
ACGACAACCT 
GTCGGCGTGG 
CATGAACGGA 
AAGGTACATC 
GGCGGCCTGA 
CACCTATACG 
GCAACAACAA 
AACGGGGCGG 
GGCAGATACC 
GTCTGCAGCT 
CTGAAAGTGG 
ACGCGGCAAG 
TCCTGAGTGC 
GAAACCGACG 
GGGCAGTGAA 
CACGGACTGC 
GCCGTAGAAC 
TGCCTCCGAA 
ACCGCACAGA 
GCGGCAGCCG 
CAGTCTCGCC 
TGCAGGGACG 
ACGGGTCTGC 
ACAGGGCGGT 
TTGCCGCGAA 
GGACGCAGCA 
TAGTCTGTTT 
AAGGCCTGTT 
GGTGCGGACG 
GGGCGCACTG 
CGGTCGAAGG 
GAAAAAGGCA 
GCTGGTCGGA 
CCGTCCTGTT 
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TGCAACGGCG 
CGGGCGGCTT 
AATATGCCGC 
CGGCAACGGC 
AGTACGGCAA 
GGTGGCGGAG 
TGCCACTGTG 
GTTTCAAAGC 
ACCAAAAAAG 
TCTGGGTCTG 
ACAAACAAAA 
AAGTTAACAA 
TGCCGCTCTG 
TAACGACATT 
AAATTAGAAG 
CGATATCGCC 
TCAAAACCGC 
GTCGATGCCA 
TGCCGCTGGC 
CAAAAGTTAC 
GCTAAAAAAG 
CAAATTTGTC 
CACGCTTGGC 
AACGGTTTGG 
CCTTGCAGAA 
GTCGGTTCAA 
GTCGCCATCG 
AGGCGTGGCA 
GCGTCAATTA 



GGCGTGGAAC 
TACCGGCGCG 
ACACCCGTCT 
TGGAACGGCT 
CCACAGCGGA 
GCACTGGATC 
GCCATTGCTG 
TGGAGAGACC 
ACGCAACTGC 
AAAAAAGTCG 
CGTCGATGCC 
CCAAGTTAGC 
GATGCAACCA 
TGCTGAAGAG 
CCGTGGCTGA 
GATTCATTGG 
CAATGAAGCC 
AAGTAAAAGC 
ACAGCTAATA 
CG AC AT C AAA 
CAAACAGTGC 
AGAATTGATG 
TTCTGCTGAA 
AT AAAAC AG T 
CAAGCCGCGC 
TGTAACGGCT 
GTACCGGCTT 
GTCGGCACTT 
CGAGTGGCTC 



GCGACCTGAA 
ACTGCAGCAA 
GGTTGCCGGC 
TGGCACGTTA 
CGAGTCGGCG 
CGCCACAAAC 
CTGCCTACAA 
AT CT AC G AC A 
AGCCGATGTT 
TGACTAACCT 
AAAGTAAAAG 
AGACACTGAT 
CCAACGCCTT 
ACTAAGACAA 
TACCGTCGAC 
ATGAAACCAA 
AAACAGACGG 
TGCAGAAACT 
CTGCAGCCGA 
GCTGATATCG 
CGACGTGTAC 
GTCTGAACGC 
AAATCCATTG 
GTCAGACCTG 
TCTCCGGTCT 
GCAGTCGGCG 
CCGCTTTACC 
CGTCCGGTTC 
GAGCACCACC 



CGGACGCGAC 
CCGGCAAGAC 
CTGGGCGCGG 
CAGCTACGCC 
TAGGCTACCG 
GACGACGATG 
CAATGGCCAA 
TTGATGAAGA 
GAAGCCGACG 
GACCAAAACC 
CTGCAGAATC 
GCCGCTTTAG 
GAATAAATTG 
ATATCGTAAA 
AAGCATGCCG 
CACTAAGGCA 
CCGAAGAAAC 
GCAGCAGGCA 
CAAGGCCGAA 
CTACGAACAA 
ACCAGAGAAG 
TACTACCGAA 
CCGATCACGA 
CGCAAAGAAA 
GTTCCAACCT 
GCTACAAATC 
GAAAACTTTG 
TTCCGCAGCC 
ACCACCACCA 



TACACGGTAA 
GGGGGCACGC 
ATGTCGAATT 
GGTTCCAAAC 
GTTCCTCGAG 
TTAAAAAAGC 
GAAATCAACG 
CGGCACAATT 
ACTTTAAAGG 
GTCAATGAAA 
TGAAATAGAA 
CAGATACTGA 
GG AG AAAAT A 
AATTGATGAA 
AAGCATTCAA 
GACGAAGCCG 
CAAACAAAAC 
AAGCCGAAGC 
GCTGTCGCTG 
AGATAATATT 
AGTCTGACAG 
AAATT GGAC A 
TACTCGCCTG 
CCCGCCAAGG 
TACAACGTGG 
CGAATCGGCA 
CCGCCAAAGC 
TACCATGTCG 
CTGA 
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1301 
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1401 



MTSAPDFNAG 
VAVTDRDAKI 
GIVDTGESVG 
ASFDDEAVIE 
ATLHIMNTND 
LFQIANSEEQ 
IFSTGNDAQA 
GTEPLEYGSN 
TAALLLQKYP 
PASFPFGDFT 
GKTIIEGGSL 
DQSGANETVH 
GAGYLNSTGR 
GDTLSYYVRR 
SSATPETVET 
ATVYADSTAA 
VEGKMRGSTQ 
AGIRHDAGDI 
GGVNVPFAAT 
LAGLKLSQPL 
NMPHTRLVAG 
GGGGTGSATN 
TKKDATAADV 
KLTTKLADTD 
KLEAVADTVD 
VDAKVKAAET 
AKKANSADVY 
NGLDKTVSDL 
VAIGTGFRFT 



GTGIGSNSRA 
NAPPPNLHTG 
SISFPELYGR 
TEAKPTDIRH 
ETKNEMMVAA 
YRQALLDYSG 
QPNTYALLPF 
HCGITAMWCL 
WMSNDNLRTT 
ADTKGTSDIA 
VLYGNNKSDM 
IKGSLQLDGK 
RVPFLSAAKI 
GNAARTASAA 
AAADRTDMPG 
HADMQGRRLK 
TVGIAAKTGE 
GYLKGLFSYG 
GDLTVEGGLR 
SDKAVLFATA 
LGADVEFGNG 
D D DVKKAATV 
EADDFKGLGL 
AALADTDAAL 
KHAEAFNDIA 
AAGKAEAAAG 
TREESDSKFV 
RKETRQGLAE 
EN FAAKAGVA 



TTAKSAAVSY 
DFPNPN DAYK 
KEHGYNENYK 
VKEIGHIDLV 
I RNAWVKLGE 
GDKTDEGIRL 
YEKDAQKGII 
SAPYEASVRF 
LLTTAQDIGA 
YSFRNDISGT 
RVETKGALIY 
GTLYTRLGKL 
GQDYSFFTNI 
AHSAPAGLKH 
IRPYGATFRA 
AVSDGLDHNG 
NTTAAATLGM 
RYKNSISRST 
YDLLKQDAFA 
GVERDLNGRD 
WNGLARYSYA 
AIAAAYNNGQ 
KKVVTNLTKT 
DATTNALNKL 
DSLDETNTKA 
TANTAADKAE 
RIDGLNATTE 
QAALSGLFQP 
VGTSSGSSAA 



AGIKNEMCKD 
NLINLKPAIE 
NYTAYMRKEA 
SHIIGGRSVD 
RGVRIVNNSF 
MQQSDYGNLS 
TVAGVDRSGE 
TRTNPIQIAG 
VGVDSKFGWG 
GGLIKKGGSQ 
NGAASGGSLN 
LKVDGTAIIG 
ETDGGLLASL 
AVEQGGSNLE 
AAAVQHANAA 
TGLRVIAQTQ 
GRSTWSENSA 
GADEHAEGSV 
EKGSALGWSG 
YTVTGGFTGA 
GSKQYGNHSG 
EINGFKAGET 
VNENKQNVDA 
GENITTFAEE 
DEAVKTANEA 
A VAAKVT D I K 
KLDTRLASAE 
YNVGRFNVTA 
YHVGVNYEWL 



RSMLCAGRDD 
AGYTGRGVEV 
PEDGGGKDIE 
GRPAGGIAPD 
GTTSRAGTAD 
YHIRNKNMLF 
KFKREMYGEP 
TSFSAPIVTG 
LLDAGKAMNG 
LQLHGNNTYT 
SDGIVYLADT 
GKLYMSARGK 
DSVEKTAGSE 
NLMVELDASE 
DGVRIFNSLA 
QDGGTWEQGG 
NAKTDSISLF 
NGTLMQLGAL 
NSLTEGTLVG 
TAATGKTGAR 
RVGVGYRFLE 
I YDIDEDGTI 
KVKAAESEIE 
TKTNIVKIDE 
KQTAEETKQN 
ADIATNKDNI 
KSIADHDTRL 
AVGGYKSESA 
EHHHHHH* 
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-961c 

ATGACTTCTG 
CAGCAGAGCA 
AGAACGAAAT 
GTTGCGGTTA 
GCATACCGGA 



CGCCCGACTT 
ACAACAGCGA 
GTGCAAAGAC 
CAGACAGGGA 
GACTTTCCAA 



CAATGCAGGC 
AATCAGCAGC 
AGAAGCATGC 
TGCCAAAATC 
ACCCAAATGA 



GGTACCGGTA 
AGTATCTTAC 
TCTGTGCCGG 
AATGCCCCCC 
CGCATACAAG 



TCGGCAGCAA 
GCCGGTATCA 
TCGGGATGAC 
CCCCGAATCT 
AATTTGATCA 
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3551 



ACCTCAAACC 
GGTATCGTCG 
GTATGGCAGA 
CGTATATGCG 
GCTTCTTTCG 
TATCCGCCAC 
TTGGCGGGCG 
GCGACGCTAC 
GGTTGCAGCC 
GCATCGTCAA 
CTTTTCCAAA 
CTATTCCGGC 
GCGATTACGG 
ATCTTTTCGA 
ATTGCCATTT 
GCGTAGACCG 
GGTACAGAAC 
GTGGTGCCTG 
ACCCGATTCA 
ACGGCGGCTC 
GCGTACCACG 
ACAGCAAGTT 
CCCGCGTCCT 
CGATATTGCC 
TCAAAAAAGG 
GGCAAAACCA 
ATCGGATATG 
CATCCGGCGG 
GACCAATCCG 
GGACGGCAAA 
ACGGTACGGC 
GGGGCAGGCT 
CGCCAAAATC 
GCGGCCTGCT 
GGCGACACGC 
TTCGGCAGCG 
AGGGCGGCAG 
TCATCCGCAA 
TATGCCGGGC 
TACAGCATGC 
GCTACCGTCT 
CCGCCTGAAA 
GCGTCATCGC 
GTTGAAGGCA 
AACCGGCGAA 
CATGGAGCGA 
GCAGGC AT AC 
CTCCTACGGA 
AACATGCGGA 
GGCGGTGTCA 
CGGTCTGCGC 
GTGCTTTGGG 
CTCGCGGGTC 
TGCAACGGCG 
CGGGCGGCTT 
AATATGCCGC 
CGGCAACGGC 
AGTACGGCAA 
GGTGGCGGAG 
TGCCACTGTG 
GTTTCAAAGC 
ACCAAAAAAG 
TCTGGGTCTG 
ACAAACAAAA 
AAGTTAACAA 
TGCCGCTCTG 
TAACGACATT 



TGCAATTGAA 
ACACAGGCGA 
AAAGAACACG 
GAAGGAAGCG 
ACGATGAGGC 
GTAAAAGAAA 
TTCCGTGGAC 
ACATAATGAA 
ATCCGCAATG 
TAACAGTTTT 
TAGCCAATTC 
GGTGATAAAA 
CAACCTGTCC 
CAGGCAATGA 
TAT GAAAAAG 
CAGTGGAGAA 
CGCTTGAGTA 
TCGGCACCCT 
AATTGCCGGA 
TGCTGCTGCA 
TTGCTGACGA 
CGGCTGGGGA 
TTCCGTTCGG 
TACTCCTTCC 
CGGCAGCCAA 
TTATCGAAGG 
CGCGTCGAAA 
CAGCCTGAAC 
GCGCAAACGA 
GGTACGCTGT 
GATTATCGGC 
ATCTCAACAG 
GGGCAGGATT 
GGCTTCCCTC 
TGTCCTATTA 
GCACATTCCG 
CAATCTGGAA 
CACCCGAGAC 
ATCCGCCCCT 
GAATGCCGCC 
ATGCCGACAG 
GCCGTATCGG 
GCAAACCCAA 
AAATGCGCGG 
AATACGACAG 
AAACAGTGCA 
GGCACGATGC 
CGCTACAAAA 
AGGCAGCGTC 
ACGTTCCGTT 
TACGACCTGC 
CTGGAGCGGC 
TGAAGCTGTC 
GGCGTGGAAC 
TACCGGCGCG 
ACACCCGTCT 
TGGAACGGCT 
CCACAGCGGA 
GCACTGGATC 
GCCATTGCTG 
TGGAGAGACC 
ACGCAACTGC 
AAAAAAGTCG 
CGTCGATGCC 
CCAAGTTAGC 
GATGCAACCA 
TGCTGAAGAG 



GCAGGCTATA 
ATCCGTCGGC 
GCTATAACGA 
CCTGAAGACG 
CGTTATAGAG 
TCGGACACAT 
GGCAGACCTG 
TACGAATGAT 
CATGGGTCAA 
GGAACAACAT 
GGAGGAGCAG 
CAGACGAGGG 
TACCACATCC 
CGCACAAGCT 
ACGCT CAAAA 
AAGTTCAAAC 
TGGCTCCAAC 
ATGAAGCAAG 
ACATCCTTTT 
GAAATACCCG 
CGGCTCAGGA 
CTGCTGGATG 
CGACTTTACC 
GTAACGACAT 
CTGCAACTGC 
CGGTTCGCTG 
CCAAAGGTGC 
AGCGACGGCA 
AACCGTACAC 
ACACACGTTT 
GGCAAGCTGT 
TACCGGACGA 
ATTCTTTCTT 
GACAGCGTCG 
TGTCCGTCGC 
CGCCCGCCGG 
AACCTGATGG 
GGTTGAAACT 
ACGGCGCAAC 
GACGGTGTAC 
TACCGCCGCC 
ACGGGTTGGA 
CAGGACGGTG 
CAGTACCCAA 
CAGCCGCCAC 
AATGCAAAAA 
GGGCGATATC 
ACAGCATCAG 
AACGGCACGC 
TGCCGCAACG 
TCAAACAGGA 
AACAGCCTCA 
GCAACCCTTG 
GCGACCTGAA 
ACTGCAGCAA 
GGTTGCCGGC 
TGGCACGTTA 
CGAGTCGGCG 
CGCCACAAAC 
CTGCCTACAA 
ATCTACGACA 
AGCCGATGTT 
TGACTAACCT 
AAAGTAAAAG 
AGACACTGAT 
CCAACGCCTT 
ACTAAGACAA 



CAGGACGCGG 
AGCATATCCT 
AAATTACAAA 
GAGGCGGTAA 
ACTGAAGCAA 
CGATTTGGTC 
CAGGCGGTAT 
GAAACCAAGA 
GCTGGGCGAA 
CGAGGGCAGG 
TACCGCCAAG 
TATCCGCCTG 
GTAATAAAAA 
CAGCCCAACA 
AGGCATTATC 
GGGAAATGTA 
CATTGCGGAA 
CGTCCGTTTC 
CCGCACCCAT 
TGGATGAGCA 
CATCGGTGCA 
CGGGTAAGGC 
GCCGATACGA 
TTCAGGCACG 
ACGGCAACAA 
GTGTTGTACG 
GCTGATTTAT 
TTGTCTATCT 
ATCAAAGGCA 
GGGCAAACTG 
ACATGTCGGC 
CGTGTTCCCT 
CACAAACATC 
AAAAAACAGC 
GGCAATGCGG 
TCTGAAACAC 
TCGAACTGGA 
GCGGCAGCCG 
TTTCCGCGCA 
GCATCTTCAA 
CATGCCGATA 
CCACAACGGC 
GAACGTGGGA 
ACCGTCGGCA 
ACTGGGCATG 
CCGACAGCAT 
GGCTATCTCA 
CCGCAGCACC 
TGATGCAGCT 
GGAGATTTGA 
TGCATTCGCC 
CTGAAGGCAC 
AGCGATAAAG 
CGGACGCGAC 
CCGGCAAGAC 
CTGGGCGCGG 
CAGCTACGCC 
TAGGCTACCG 
GACGACGATG 
CAATGGCCAA 
TTGATGAAGA 
GAAGCCGACG 
GACCAAAACC 
CTGCAGAATC 
GCCGCTTTAG 
GAATAAATTG 
ATATCGTAAA 



GGTAGAGGTA 
TTCCCGAACT 
AACTATACGG 
AGACATTGAA 
AGCCGACGGA 
TCCCATATTA 
TGCGCCCGAT 
ACGAAATGAT 
CGTGGCGTGC 
CACTGCCGAC 
CGTTGCTCGA 
ATGCAACAGA 
CATGCTTTTC 
CATATGCCCT 
ACAGTCGCAG 
TGGAGAACCG 
TTACTGCCAT 
ACCCGTACAA 
CGTAACCGGC 
ACGACAACCT 
GTCGGCGTGG 
CATGAACGGA 
AAGGTACATC 
GGCGGCCTGA 
CACCTATACG 
GCAACAACAA 
AACGGGGCGG 
GGCAGATACC 
GTCTGCAGCT 
CTGAAAGTGG 
ACGCGGCAAG 
TCCTGAGTGC 
GAAACCGACG 
GGGCAGTGAA 
CACGGACTGC 
GCCGTAGAAC 
TGCCTCCGAA 
ACCGCACAGA 
GCGGCAGCCG 
CAGTCTCGCC 
TGCAGGGACG 
ACGGGTCTGC 
ACAGGGCGGT 
TTGCCGCGAA 
GGACGCAGCA 
TAGTCTGTTT 
AAGGCCTGTT 
GGTGCGGACG 
GGGCGCACTG 
CGGTCGAAGG 
GAAAAAG G C A 
GCTGGTCGGA 
CCGTCCTGTT 
TACACGGTAA 
GGGGGCACGC 
ATGTCGAATT 
GGTTCCAAAC 
GTTCCTCGAG 
TTAAAAAAGC 
GAAATCAACG 
CGGCACAATT 
ACTTTAAAGG 
GTCAATGAAA 
TGAAATAGAA 
CAGATACTGA 
GGAGAAAATA 
AATTGATGAA 
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3601 AAATTAGAAG CCGTGGCTGA TACCGTCGAC AAGCATGCCG AAGCATTCAA 

3 651 CGATATCGCC GATTCATTGG ATGAAACCAA CACTAAGGCA GACGAAGCCG 

3701 TCAAAACCGC CAATGAAGCC AAACAGACGG CCGAAGAAAC CAAACAAAAC 

37 51 GTCGATGCCA AAGTAAAAGC TGCAGAAACT GCAGCAGGCA AAGCCGAAGC 

3801 TGCCGCTGGC ACAGCTAATA CTGCAGCCGA CAAGGCCGAA GCTGTCGCTG 

3851 CAAAAGTTAC CGACATCAAA GCTGATATCG CTACGAACAA AGATAATATT 

3901 GCTAAAAAAG CAAACAGTGC CGACGTGTAC ACCAGAGAAG AGTCTGACAG 

3 951 CAAATTTGTC AGAATTGATG GTCTGAACGC TACTACCGAA AAATTGGACA 

4 001 CACGCTTGGC TTCTGCTGAA AAATCCATTG CCGATCACGA TACTCGCCTG 
4 051 AACGGTTTGG ATAAAACAGT GTCAGACCTG CGCAAAGAAA CCCGCCAAGG 
4101 CCTTGCAGAA CAAGCCGCGC TCTCCGGTCT GTTCCAACCT TACAACGTGG 
4151 GTCTCGAGCA CCACCACCAC CACCACTGA 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 



MTSAPDFNAG 
VAVTDRDAKI 
GIVDTGESVG 
ASFDDEAVIE 
ATLHIMNTND 
LFQIANSEEQ 
IFSTGNDAQA 
GTEPLEYGSN 
TAALLLQKYP 
PASFPFGDFT 
GKTIIEGGSL 
DQSGANETVH 
GAGYLNSTGR 
GDTLSYYVRR 
SSATPETVET 
ATVYADSTAA 
VEGKMRGSTQ 
AGIRHDAGDI 
GGVNVPFAAT 
LAGLKLSQPL 
NMPHTRLVAG 
GGGGTGSATN 
TKKDATAADV 
KLTTKLADTD 
KLEAVADTVD 
VDAKVKAAET 
AKKANSADVY 
NGLDKTVSDL 



GTGIGSNSRA 
NAPPPNLHTG 
SISFPELYGR 
TEAKPTDIRH 
ETKNEMMVAA 
YRQALLDYSG 
QPNTYALLPF 
HCGITAMWCL 
WMSNDNLRTT 
ADTKGTSDIA 
VLYGNNKSDM 
IKGSLQLDGK 
RVPFLSAAKI 
GNAARTASAA 
AAADRT DMPG 
HADMQGRRLK 
TVGIAAKTGE 
GYLKGLFSYG 
GDLTVEGGLR 
SDKAVLFATA 
LGADVE FGNG 
D D DVKKAAT V 
EADDFKGLGL 
AALADTDAAL 
KHAEAFNDIA 
AAGKAEAAAG 
TREESDSKFV 
RKETRQGLAE 



TTAKSAAVSY 
DFPNPN DAYK 
KEHGYNENYK 
VKEIGHIDLV 
IRNAWVKLGE 
GDKTDEGIRL 
YEKDAQKGII 
SAPYEASVRF 
LLTTAQDIGA 
YSFRNDISGT 
RVETKGALIY 
GTLYTRLGKL 
GQDYSFFTNI 
AHSAPAGLKH 
IRPYGAT FRA 
AVSDGLDHNG 
NTTAAATLGM 
RYKNSISRST 
YDLLKQDAFA 
GVERDLNGRD 
WNGLARYSYA 
AIAAAYNNGQ 
KKVVTNLTKT 
DATTNALNKL 
DSLDETNTKA 
TANTAADKAE 
RIDGLNATTE 
QAALSGLFQP 



AGIKNEMCKD 
NLINLKPAIE 
NYTAYMRKEA 
SHIIGGRSVD 
RGVRIVNNSF 
MQQSDYGNLS 
TVAGVDRSGE 
TRTNPIQIAG 
VGVDSKFGWG 
GGLIKKGGSQ 
NGAASGGSLN 
LKVDGTAIIG 
ETDGGLLASL 
AVEQGGSNLE 
AAAVQHANAA 
TGLRVIAQTQ 
GRSTWSENSA 
GADEHAEGSV 
EKGSALGWSG 
YTVTGGFTGA 
GSKQYGNHSG 
EINGFKAGET 
VNENKQNVDA 
GENITTFAEE 
DEAVKTANEA 
AVAAKVTDIK 
KLDTRLASAE 
YNVGLEHHHH 



RSMLCAGRDD 
AGYTGRGVEV 
PEDGGGKDIE 
GRPAGGIAPD 
GTTSRAGTAD 
YHIRNKNMLF 
KFKREMYGEP 
TSFSAPIVTG 
LLDAGKAMNG 
LQLHGNNTYT 
SDGIVYLADT 
GKLYMSARGK 
DSVEKTAGSE 
NLMVELDASE 
DGVRIFNSLA 
QDGGTWEQGG 
NAKTDSISLF 
NGTLMQLGAL 
NSLTEGTLVG 
TAATGKTGAR 
RVGVGYRFLE 
IYDIDEDGTI 
KVKAAESEIE 
TKTNIVKIDE 
KQTAEETKQN 
ADIATNKDNI 
KSIADHDTRL 
HH* 



AG741 and hybrids 

[0225] Bactericidal titres generated in response to AG741 (His-fusion) were measured against variousstrains, including 
the homologous 2996 strain: 





2996 


MC58 


NGH38 


F6124 


BZ133 


AG741 


512 


131072 


>2048 


16384 


>2048 



[0226] As can be seen, the AG741 -induced anti-bactericidal titre is particularly high against heterologous strain MC58. 
[0227] AG741 was also fused directly in-frame upstream of proteins 961 , 961c, 983 and ORF46.1 : 
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961 

ATGGTCGCCG 
GCTCGACCAT 
TCAGGAAAAA 
TATGGAAACG 
CAGCCGTTTC 



CCGACATCGG 
AAAGACAAAG 
CGAGAAACTG 
GTGACAGCCT 
GACTTTATCC 



TGCGGGGCTT 
GTTTGCAGTC 
AAGCTGGCGG 
CAATACGGGC 
GCCAAATCGA 



GCCGATGCAC 
TTTGACGCTG 
CACAAGGTGC 
AAATTGAAGA 
AGTGGACGGG 



TAACCGCACC 
GATCAGTCCG 
GGAAAAAACT 
ACGACAAGGT 
CAGCTCATTA 



CCTTGGAGAG 
ACCGCCTTTC 
GGTTGCGAAA 
CTTTTGACAA 
TTCGGTTCAG 
CGCCAAGCAG 
ATGTCGACCT 
GTCATCAGCG 
CCTCGGTATC 
TGAAAACCGT 
GAGGGTGGCG 
AGCTGCCACT 
ACGGTTTCAA 
ATTACCAAAA 
AGGTCTGGGT 
AAAACAAACA 
GAAAAGT TAA 
TGATGCCGCT 
ATATAACGAC 
G AAAAAT TAG 
CAACGATATC 
CCGTCAAAAC 
AACGTCGATG 
AGCTGCCGCT 
CTGCAAAAGT 
ATTGCTAAAA 
CAGCAAATTT 
ACACACGCTT 
CTGAACGGTT 
AGGCCTTGCA 
TGGGTCGGTT 
GCAGTCGCCA 
AGCAGGCGTG 
TCGGCGTCAA 



TGGAGAGTTC 
AGACCGAGCA 
CGCCAGTTCA 
GCTTCCCGAA 
ACGATGCCGG 
GGAAACGGCA 
GGCCGCCGCC 
GTTCCGTCCT 
TTTGGCGGAA 
AAACGGCATA 
GAGGCACTGG 
GTGGCCATTG 
AGCTGGAGAG 
AAGACGCAAC 
CTGAAAAAAG 
AAACGTCGAT 
CAACCAAGTT 
CTGGATGCAA 
ATTTGCTGAA 
AAGCCGTGGC 
GCCGATTCAT 
CGCCAATGAA 
CCAAAGTAAA 
GGCACAGCTA 
TACCGACATC 
AAGCAAACAG 
GTCAGAATTG 
GGCTTCTGCT 
TGGATAAAAC 
GAACAAGCCG 
CAATGTAACG 
TCGGTACCGG 
GCAGTCGGCA 
TTACGAGTGG 



CAAGTATACA 
AATACAAGAT 
GAATCGGCGA 
GGCGGCAGGG 
CGGAAAACTG 
AAAT CGAACA 
GATATCAAGC 
TTACAACCAA 
AAGCCCAGGA 
CGCCATATCG 
ATCCGCCACA 
CTGCTGCCTA 
ACCATCTACG 
TGCAGCCGAT 
TCGTGACTAA 
GCCAAAGTAA 
AGCAGACACT 
CCACCAACGC 
GAGACTAAGA 
TGATACCGTC 
TGGATGAAAC 
GCCAAACAGA 
AGCTGCAGAA 
ATACTGCAGC 
AAAGCTGATA 
TGCCGACGTG 
ATGGTCTGAA 
G AAAAAT C C A 
AGTGTCAGAC 
CGCTCTCCGG 
GCTGCAGTCG 
CTTCCGCTTT 
CTTCGTCCGG 
CTCGAGCACC 



AACAAAGCCA 
TCGGAGCATT 
CATAGCGGGC 
CGACATATCG 
ACCTACACCA 
TTTGAAATCG 
CGGATGGAAA 
GCCGAGAAAG 
AGTTGCCGGC 
GCCTTGCCGC 
AACGACGACG 
CAACAATGGC 
ACATTGATGA 
GTTGAAGCCG 
CCTGACCAAA 
AAGCTGCAGA 
GATGCCGCTT 
CTTGAATAAA 
CAAATATCGT 
GACAAGCATG 
CAACACTAAG 
CGGCCGAAGA 
ACTGCAGCAG 
CGACAAGGCC 
TCGCTACGAA 
TACACCAGAG 
CGCTACTACC 
TTGCCGATCA 
CTGCGCAAAG 
TCTGTTCCAA 
GCGGCTACAA 
ACCGAAAACT 
TTCTTCCGCA 
ACCACCACCA 



TTCCGCCTTA 
CCGGGAAGAT 
GAACATACAT 
CGGGACGGCG 
TAGATTTCGC 
CCAGAACTCA 
ACGCCATGCC 
GCAGTTACTC 
AGCGCGGAAG 
CAAGCAACTC 
ATGTTAAAAA 
C AAGAAAT C A 
AGACGGCACA 
ACGACTTTAA 
ACCGTCAATG 
ATCTGAAATA 
T AG C AG AT AC 
T T G GG AG AAA 
AAAAAT T GAT 
CCGAAGCATT 
GCAGACGAAG 
AACCAAACAA 
GCAAAGCCGA 
GAAGCTGTCG 
CAAAGATAAT 
AAGAGTCTGA 
GAAAAATTGG 
CGATACTCGC 
AAACCCGCCA 
CCTTACAACG 
ATCCGAATCG 
TTGCCGCCAA 
GCCTACCATG 
CCACTGA 



MVAADIGAGL 
YGNGDSLNTG 
TAFQTEQIQD 
FGSDDAGGKL 
VISGSVLYNQ 
EGGGGTGSAT 
ITKKDATAAD 
EKLTTKLADT 
EKLEAVADTV 
NVDAKVKAAE 
IAKKANSADV 
LNGLDKTVSD 
AVAIGTGFRF 



ADALTAPLDH 
KLKNDKVSRF 
SEHSGKMVAK 
TYTIDFAAKQ 
AEKGSYSLGI 
NDDDVKKAAT 
VEADDFKGLG 
DAALADT DAA 
DKHAEAFNDI 
TAAGKAEAAA 
YTREESDSKF 
LRKETRQGLA 
TENFAAKAGV 



KDKGLQSLTL 
DFIRQIEVDG 
RQFRIGDIAG 
GNGKIEHLKS 
FGGKAQEVAG 
VAIAAAYNNG 
LKKVVTN LTK 
LDATTNALNK 
ADSLDETNTK 
GTANTAADKA 
VRIDGLNATT 
EQAALSGLFQ 
AVGTSSGSSA 



DQSVRKNEKL 
QLITLESGEF 
EHTSFDKLPE 
PELNVDLAAA 
SAEVKTVNGI 
QEINGFKAGE 
TVNENKQNVD 
LGENITTFAE 
ADEAVKTANE 
EAVAAKVTDI 
EKLDTRLASA 
PYNVGRFNVT 
AYHVGVNYEW 



KLAAQGAEKT 
QVYKQSHSAL 
GGRATYRGTA 
DIKPDGKRHA 
RHIGLAAKQL 
TIYDIDEDGT 
AKVKAAESEI 
ETKTNIVKID 
AKQTAEETKQ 
KADIATNKDN 
EKSIADHDTR 
AAVGGYKSES 
LEHHHHHH* 
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AG741-961C 



10 



15 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 



ATGGTCGCCG 
GCTCGACCAT 
TCAGGAAAAA 
TATGGAAACG 
CAGCCGTTTC 
CCTTGGAGAG 
ACCGCCTTTC 
GGTTGCGAAA 
CTTTTGACAA 
TTCGGTTCAG 
CGCCAAGCAG 
ATGTCGACCT 
GTCATCAGCG 
CCTCGGTATC 
TGAAAACCGT 
GAGGGTGGCG 



CCGACATCGG 
AAAGACAAAG 
CGAGAAACTG 
GTGACAGCCT 
GACTTTATCC 
TGGAGAGTTC 
AGACCGAGCA 
CGCCAGTTCA 
GCTTCCCGAA 
ACGATGCCGG 
GGAAACGGCA 
GGCCGCCGCC 
GTTCCGTCCT 
TTTGGCGGAA 
AAACGGCATA 
GAGGCACTGG 



TGCGGGGCTT 
GTTTGCAGTC 
AAGCTGGCGG 
CAATACGGGC 
GCCAAATCGA 
CAAGTATACA 
AATACAAGAT 
GAATCGGCGA 
GGCGGCAGGG 
CGGAAAACTG 
AAATCGAACA 
GATATCAAGC 
TTACAACCAA 
AAGCCCAGGA 
CGCCATATCG 
ATCCGCCACA 



GCCGATGCAC 
TTTGACGCTG 
CACAAGGTGC 
AAATTGAAGA 
AGTGGACGGG 
AACAAAG CCA 
TCGGAGCATT 
CATAGCGGGC 
CGACATATCG 
ACCTACACCA 
TTTGAAATCG 
CGGATGGAAA 
GCCGAGAAAG 
AGTTGCCGGC 
GCCTTGCCGC 
AACGACGACG 



TAACCGCACC 
GATCAGTCCG 
GGAAAAAACT 
ACGACAAGGT 
CAGCTCATTA 
TTCCGCCTTA 
CCGGGAAGAT 
GAACATACAT 
CGGGACGGCG 
TAGATTTCGC 
CCAGAACTCA 
ACGCCATGCC 
GCAGTTACTC 
AGCGCGGAAG 
CAAGCAACTC 
ATGTTAAAAA 



20 



25 



30 



35 



801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 



AGCTGCCACT 
ACGGTTTCAA 
ATTACCAAAA 
AGGTCTGGGT 
AAAACAAACA 
GAAAAGTTAA 
TGATGCCGCT 
AT AT AAC G AC 
GAAAAATTAG 
CAACGATATC 
CCGTCAAAAC 
AACGTCGATG 
AGCTGCCGCT 
CTGCAAAAGT 
ATTGCTAAAA 
CAGCAAATTT 
ACACACGCTT 
CTGAACGGTT 
AGGCCTTGCA 
TGGGTCTCGA 



GTGGCCATTG 
AGCTGGAGAG 
AAGACGCAAC 
CTGAAAAAAG 
AAACGTCGAT 
CAACCAAGTT 
CTGGATGCAA 
ATTTGCTGAA 
AAGCCGTGGC 
GCCGATTCAT 
CGCCAATGAA 
CCAAAGTAAA 
GGCACAGCTA 
TACCGACATC 
AAGCAAACAG 
GTCAGAATTG 
GGCTTCTGCT 
TGGATAAAAC 
GAACAAGCCG 
GCACCACCAC 



CTGCTGCCTA 
ACCATCTACG 
TGCAGCCGAT 
TCGTGACTAA 
GCCAAAGTAA 
AGCAGACACT 
CCACCAACGC 
GAGACTAAGA 
TGATACCGTC 
TGGATGAAAC 
GCCAAACAGA 
AGCTGCAGAA 
ATACTGCAGC 
AAAGCTGATA 
TGCCGACGTG 
ATGGTCTGAA 
GAAAAATCCA 
AGTGTCAGAC 
CGCTCTCCGG 
CACCACCACT 



CAACAATGGC 
ACATTGATGA 
GTTGAAGCCG 
CCTGACCAAA 
AAGCTGCAGA 
GATGCCGCTT 
CTTGAATAAA 
CAAATATCGT 
GACAAGCATG 
CAACACTAAG 
CGGCCGAAGA 
ACTGCAGCAG 
CGACAAGGCC 
TCGCTACGAA 
TACACCAGAG 
CGCTACTACC 
TTGCCGATCA 
CTGCGCAAAG 
TCTGTTCCAA 
GA 



CAAGAAATCA 
AGACGGCACA 
ACGACTTTAA 
ACCGTCAATG 
ATCTGAAATA 
TAGCAGATAC 
TTGGGAGAAA 
AAAAATTGAT 
CCGAAGCATT 
GCAGACGAAG 
AACCAAACAA 
GCAAAGCCGA 
GAAGCTGTCG 
CAAAGATAAT 
AAGAGTCTGA 
GAAAAATTGG 
CGATACTCGC 
AAACCCGCCA 
CCTTACAACG 



40 



45 



50 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 



MVAADIGAGL 
YGNGDSLNTG 
TAFQTEQIQD 
FGSDDAGGKL 
VISGSVLYNQ 
EGGGGTGSAT 
ITKKDATAAD 
EKLTTKLADT 
EKLEAVADTV 
NVDAKVKAAE 
IAKKANSADV 
LNGLDKTVSD 



ADALTAPLDH 
KLKNDKVSRF 
SEHSGKMVAK 
TYTIDFAAKQ 
AEKGSYSLGI 
NDDDVKKAAT 
VEADDFKGLG 
DAALADTDAA 
DKHAEAFNDI 
TAAGKAEAAA 
YTREESDSKF 
LRKETRQGLA 



KDKGLQSLTL 
DFIRQIEVDG 
RQFRIGDIAG 
GNGKIEHLKS 
FGGKAQEVAG 
VAIAAAYNNG 
LKKVVTNLTK 
LDATTNALNK 
ADSLDETNTK 
GTANTAADKA 
VRIDGLNATT 
EQAALSGLFQ 



DQSVRKNEKL 
QLITLESGEF 
EHTSFDKLPE 
PELNVDLAAA 
SAEVKTVNGI 
QEINGFKAGE 
TVNENKQNVD 
LGENITTFAE 
ADEAVKTANE 
E AVAAKVT D I 
EKLDTRLASA 
PYNVGLEHHH 



KLAAQGAEKT 
QVYKQSHSAL 
GGRATYRGTA 
DIKPDGKRHA 
RHIGLAAKQL 
TIYDIDEDGT 
AKVKAAESEI 
ETKTNIVKID 
AKQTAEETKQ 
KADIATNKDN 
EKSIADHDTR 
HHH* 
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983 

ATGGTCGCCG 
GCTCGACCAT 
TCAGGAAAAA 
TATGGAAACG 
CAGCCGTTTC 
CCTTGGAGAG 
ACCGCCTTTC 
GGTTGCGAAA 
CTTTTGACAA 
TTCGGTTCAG 
CGCCAAGCAG 
ATGTCGACCT 
GTCATCAGCG 
CCTCGGTATC 
TGAAAACCGT 
GAGGGATCCG 
TACCGGTATC 
TATCTTACGC 
TGTGCCGGTC 
TGCCCCCCCC 
CATACAAGAA 
GGACGCGGGG 
CATATCCTTT 
AT T AC AAAAA 
GGCGGTAAAG 
TGAAGCAAAG 
ATTTGGTCTC 
GGCGGTATTG 
AACCAAGAAC 
TGGGCGAACG 
AGGGCAGGCA 



CCGACATCGG 
AAAGACAAAG 
CGAGAAACTG 
GTGACAGCCT 
GACTTTATCC 
TGGAGAGTTC 
AG AC C GAG C A 
CGCCAGTTCA 
GCTTCCCGAA 
ACGATGCCGG 
GGAAACGGCA 
GGCCGCCGCC 
GTTCCGTCCT 
TTTGGCGGAA 
AAAC G G CAT A 
GCGGAGGCGG 
GGCAGCAACA 
CGGTATCAAG 
GGGATGACGT 
CCGAATCTGC 
TTTGATCAAC 
TAGAGGTAGG 
CCCGAACTGT 
CTATACGGCG 
ACATTGAAGC 
CCGACGGATA 
CCATATTATT 
CGCCCGATGC 
GAAATGATGG 
TGGCGTGCGC 
CTGCCGACCT 



TGCGGGGCTT 
GTTTGCAGTC 
AAGCTGGCGG 
CAATACGGGC 
GCCAAATCGA 
CAAGTATACA 
AATACAAGAT 
GAATCGGCGA 
GGCGGCAGGG 
CGGAAAACTG 
AAATCGAACA 
GAT AT CAAGC 
TTACAACCAA 
AAGCCCAGGA 
CGCCATATCG 
CACTTCTGCG 
GCAGAGCAAC 
AACGAAATGT 
TGCGGTTACA 
ATACCGGAGA 
CTCAAACCTG 
TATCGTCGAC 
ATGGCAGAAA 
TATATGCGGA 
TTCTTTCGAC 
TCCGCCACGT 
GGCGGGCGTT 
GACGCTACAC 
TTGCAGCCAT 
ATCGTCAATA 
TTTCCAAATA 



GCCGATGCAC 
TTTGACGCTG 
CACAAGGTGC 
AAATTGAAGA 
AGTGGACGGG 
AACAAAGCCA 
TCGGAGCATT 
CATAGCGGGC 
C G AC AT AT CG 
ACCTACACCA 
TTTGAAATCG 
CGGATGGAAA 
GCCGAGAAAG 
AGTTGCCGGC 
GCCTTGCCGC 
CCCGACTTCA 
AACAGCGAAA 
GCAAAGACAG 
GACAGGGATG 
CTTTCCAAAC 
CAATTGAAGC 
ACAGGCGAAT 
AGAACACGGC 
AGGAAGCGCC 
GATGAGGCCG 
AAAAGAAATC 
CCGTGGACGG 
ATAATGAATA 
CCGCAATGCA 
ACAGTTTTGG 
GCCAATTCGG 



TAACCGCACC 
GATCAGTCCG 
GGAAAAAACT 
ACGACAAGGT 
CAGCTCATTA 
TTCCGCCTTA 
CCGGGAAGAT 
GAACATACAT 
CGGGACGGCG 
TAGATTTCGC 
CCAGAACTCA 
ACGCCATGCC 
GCAGTTACTC 
AGCGCGGAAG 
CAAGCAACTC 
ATGCAGGCGG 
TCAGCAGCAG 
AAGCATGCTC 
CCAAAATCAA 
CCAAATGACG 
AGGCTATACA 
CCGTCGGCAG 
T AT AAC G AAA 
TGAAGACGGA 
TTATAGAGAC 
GG AC ACATCG 
CAGACCTGCA 
CGAATGATGA 
TGGGTCAAGC 
AACAACATCG 
AGGAGCAGTA 
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15 



20 



25 



30 



35 



40 



1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 
3351 
3401 
3451 
3501 
3551 
3601 
3651 
3701 
3751 
3801 
3851 
3901 



CCGCCAAGCG 
TCCGCCTGAT 
AATAAAAACA 
GCCCAACACA 
G CAT TAT C AC 
GAAATGTATG 
TTGCGGAATT 
TCCGTTTCAC 
GCACCCATCG 
GATGAGCAAC 
TCGGTGCAGT 
GGTAAGGCCA 
CGATACGAAA 
CAGGCACGGG 
GGCAACAACA 
GTTGTACGGC 
TGATTTATAA 
GTCTATCTGG 
CAAAGGCAGT 
GCAAACTGCT 
ATGTCGGCAC 
TGTTCCCTTC 
CAAACATCGA 
AAAACAGCGG 
CAATGCGGCA 
TGAAACACGC 
GAACTGGATG 
GGCAGCCGAC 
TCCGCGCAGC 
ATCTTCAACA 
TGCCGATATG 
ACAACGGCAC 
ACGTGGGAAC 
CGTCGGCATT 
TGGGCATGGG 
GACAGCATTA 
CTATCTCAAA 
GCAGCACCGG 
ATGCAGCTGG 
AGATTTGACG 
CATTCGCCGA 
GAAGGCACGC 
CGATAAAGCC 
GACGCGACTA 
GGCAAGACGG 
GGGCGCGGAT 
GCTACGCCGG 
GGCTACCGGT 



TTGCTCGACT 
GCAACAGAGC 
TGCTTTTCAT 
TATGCCCTAT 
AGTCGCAGGC 
GAGAACCGGG 
ACTGCCATGT 
CCGTACAAAC 
TAACCGGCAC 
GACAACCTGC 
CGGCGTGGAC 
TGAACGGACC 
GGTACATCCG 
CGGCCTGATC 
CCTATACGGG 
AACAACAAAT 
CGGGGCGGCA 
CAGATACCGA 
CTGCAGCTGG 
GAAAGTGGAC 
GCGGCAAGGG 
CTGAGTGCCG 
AACCGACGGC 
GCAGTGAAGG 
CGGACTGCTT 
CGTAGAACAG 
CCTCCGAATC 
CGCACAGATA 
GGCAGCCGTA 
GTCTCGCCGC 
CAGGGACGCC 
GGGTCTGCGC 
AGGGCGGTGT 
GCCGCGAAAA 
ACGCAGCACA 
GTCTGTTTGC 
GGCCTGTTCT 
TGCGGACGAA 
GCGCACTGGG 
GTCGAAGGCG 
AAAAGGCAGT 
TGGTCGGACT 
GTCCTGTTTG 
CACGGTAACG 
GGGCACGCAA 
GTCGAATTCG 
TTCCAAACAG 
TCCTCGAGCA 



ATTCCGGCGG 
GATTACGGCA 
CTTTTCGACA 
TGCCATTTTA 
GTAGACCGCA 
TACAGAACCG 
GGTGCCTGTC 
CCGATTCAAA 
GGCGGCTCTG 
GTACCACGTT 
AGCAAGTTCG 
CGCGTCCTTT 
ATATTGCCTA 
AAAAAAGGCG 
CAAAACCATT 
CGGATATGCG 
TCCGGCGGCA 
CCAATCCGGC 
ACGGCAAAGG 
GGTACGGCGA 
GGCAGGCTAT 
CCAAAATCGG 
GGCCTGCTGG 
CGACACGCTG 
CGGCAGCGGC 
GGCGGCAGCA 
ATCCGCAACA 
TGCCGGGCAT 
CAGCATGCGA 
TACCGTCTAT 
GCCTGAAAGC 
GTCATCGCGC 
TGAAGGCAAA 
CCGGCGAAAA 
TGGAGCGAAA 
AGGCATACGG 
CCTACGGACG 
CATGCGGAAG 
CGGTGTCAAC 
GTCTGCGCTA 
GCTTTGGGCT 
CGCGGGTCTG 
CAACGGCGGG 
GGCGGCTTTA 
TATGCCGCAC 
GCAACGGCTG 
TACGGCAACC 
CCACCACCAC 



TGATAAAACA 
ACCTGTCCTA 
GGCAATGACG 
TGAAAAAGAC 
GTGGAGAAAA 
CTTGAGTATG 
GGCACCCTAT 
TTGCCGGAAC 
CTGCTGCAGA 
GCTGACGACG 
GCTGGGGACT 
CCGTTCGGCG 
CTCCTTCCGT 
GCAGCCAACT 
ATCGAAGGCG 
CGTCGAAACC 
GCCTGAACAG 
GCAAACGAAA 
TACGCTGTAC 
TTATCGGCGG 
CTCAACAGTA 
GCAGGATTAT 
CTTCCCTCGA 
TCCTATTATG 
ACATTCCGCG 
ATCTGGAAAA 
CCCGAGACGG 
CCGCCCCTAC 
ATGCCGCCGA 
GCCGACAGTA 
CGTATCGGAC 
AAACCCAACA 
ATGCGCGGCA 
TACGACAGCA 
ACAGTGCAAA 
CACGATGCGG 
C T AC AAAAAC 
GCAGCGTCAA 
GTTCCGTTTG 
CGACCTGCTC 
GGAGCGGCAA 
AAGCTGTCGC 
CGTGGAACGC 
CCGGCGCGAC 
ACCCGTCTGG 
GAACGGCTTG 
ACAGCGGACG 
CACCACTGA 



GACGAGGGTA 
CCACATCCGT 
CACAAGCTCA 
GCTCAAAAAG 
GTTCAAACGG 
GCTCCAACCA 
GAAGCAAGCG 
ATCCTTTTCC 
AATACCCGTG 
GCTCAGGACA 
GCTGGATGCG 
ACTTTACCGC 
AACGACATTT 
GCAACTGCAC 
GTTCGCTGGT 
AAAGGTGCGC 
CGACGGCATT 
CCGTACACAT 
ACACGTTTGG 
CAAGCTGTAC 
CCGGACGACG 
TCTTTCTTCA 
CAGCGTCGAA 
TCCGTCGCGG 
CCCGCCGGTC 
CCTGATGGTC 
TTGAAACTGC 
GGCGCAACTT 
CGGTGTACGC 
CCGCCGCCCA 
GGGTTGGACC 
GGACGGTGGA 
GTACCCAAAC 
GCCGCCACAC 
TGCAAAAACC 
GCGATATCGG 
AG CAT C AG CC 
CGGCACGCTG 
CCGCAACGGG 
AAACAGGATG 
CAGCCTCACT 
AACCCTTGAG 
GACCTGAACG 
TGCAGCAACC 
TTGCCGGCCT 
GCACGTTACA 
AGTCGGCGTA 
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MVAADIGAGL 
YGNGDSLNTG 
TAFQTEQIQD 
FGSDDAGGKL 
VISGSVLYNQ 
EGSGGGGTSA 
CAGRDDVAVT 
GRGVEVGIVD 
GGKDIEASFD 
GGIAPDATLH 
RAGTADLFQI 
NKNMLFIFST 
EMYGEPGTEP 
APIVTGTAAL 
GKAMNGPASF 
GNNTYTGKTI 
VYLADTDQSG 
MSARGKGAGY 



ADALTAPLDH 
KLKNDKVSRF 
SEHSGKMVAK 
TYTIDFAAKQ 
AEKGSYSLGI 
PDFNAGGTGI 
DRDAKINAPP 
TGESVGSISF 
DEAVIETEAK 
IMNTNDETKN 
ANSEEQYRQA 
GNDAQAQPNT 
LEYGSNHCGI 
LLQKY PWMSN 
PFGDFTADTK 
IEGGSLVLYG 
ANETVHIKGS 
LNSTGRRVPF 



KDKGLQSLTL 
DFIRQIEVDG 
RQFRIGDIAG 
GNGKIEHLKS 
FGGKAQEVAG 
GSNSRATTAK 
PNLHTGDFPN 
PELYGRKEHG 
PTDIRHVKEI 
EMMVAAIRNA 
LLDYSGGDKT 
YALLPFYEKD 
TAMWCLSAPY 
DNLRTTLLTT 
GTSDIAYSFR 
NNKSDMRVET 
LQLDGKGTLY 
LSAAKIGQDY 



DQSVRKNEKL 
QLITLESGEF 
EHTSFDKLPE 
PELNVDLAAA 
SAEVKTVNGI 
SAAVSYAGIK 
PNDAYKNLIN 
YNENYKNYTA 
GHIDLVSHII 
WVKLGERGVR 
DEGIRLMQQS 
AQKGIITVAG 
EASVRFTRTN 
AQDIGAVGVD 
NDISGTGGLI 
KGALIYNGAA 
TRLGKLLKVD 
SFFTNIETDG 



KLAAQGAEKT 
QVYKQSHSAL 
GGRATYRGTA 
DIKPDGKRHA 
RHIGLAAKQL 
NEMCKDRSML 
LKPAIEAGYT 
YMRKEAPEDG 
GGRSVDGRPA 
IVNNSFGTTS 
DYGNLSYHIR 
VDRSGEKFKR 
PIQIAGTSFS 
SKFGWGLLDA 
KKGGSQLQLH 
SGGSLNSDGI 
GTAIIGGKLY 
GLLASLDSVE 



KTAGSEGDTL SYYVRRGNAA RTASAAAHSA PAGLKHAVEQ GGSNLENLMV 
ELDASESSAT PETVETAAAD RTDMPGIRPY GAT FRAAAAV QHANAADGVR 
IFTCSLAATVY ADSTAAHADM QGRRLKAVSD GLDHNGTGLR VIAQTQQDGG 
TWEQGGVEGK MRGSTQTVGI AAKTGENTTA AATLGMGRST WSENSANAKT 
DSISLFAGIR HDAGDIGYLK GLFSYGRYKN SISRSTGADE HAEGSVNGTL 
MQLGALGGVN VPFAATGDLT VEGGLRYDLL KQDAFAEKGS ALGWSGNSLT 
EGTLVGLAGL KLSQPLSDKA VLFATAGVER DLNGRDYTVT GGFTGATAAT 
GKTGARNMPH TRLVAGLGAD VEFGNGWNGL ARYSYAGSKQ YGNHSGRVGV 
GYRFLEHHHH HH* 
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AG741-ORF46.1 



10 



15 



20 



25 



30 



35 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 



ATGGTCGCCG 
GCTCGACCAT 
TCAGGAAAAA 
TATGGAAACG 
CAGCCGTTTC 
CCTTGGAGAG 
ACCGCCTTTC 
GGTTGCGAAA 
CTTTTGACAA 
TTCGGTTCAG 
CGCCAAGCAG 
ATGTCGACCT 
GTCATCAGCG 
CCTCGGTATC 
TGAAAACCGT 
GACGGTGGCG 
CCGGCAGGTT 
TATTCGGCAG 
G G AAAAAT AC 
CATTAAAGGA 
AAGTCCATTC 
GCCGGTAGTC 
ATACGAACAC 
ATCCCGCTCC 
GTTGCCCAAA 
ACGGCTTGCC 
TAGGCGACGG 
TCGGGCAATG 
CATCATCGGC 
GCATAAGCGA 
TCCACCGAAA 
ACTCAAAGAC 
CCAATGCCGC 
ATCCCCATCA 
CATCACGGCA 
CGAAAGGGAA 
AAATACCCGT 
GCGTTACGGC 
G C AAAAAT GT 
TTTGACGGTA 
GCTCGAGCAC 



CCGACATCGG 
AAAGACAAAG 
CGAGAAACTG 
GTGACAGCCT 
GACTTTATCC 
TGGAGAGTTC 
AGACCGAGCA 
CGCCAGTTCA 
GCTTCCCGAA 
ACGATGCCGG 
GGAAACGGCA 
GGCCGCCGCC 
GTTCCGTCCT 
TTTGGCGGAA 
AAACGGCATA 
GAGGCACTGG 
CTCGACCGTC 
CAGGGGGGAA 
AAAGCCATCA 
AATATCGGCT 
CCCCTTCGAC 
CCGTTGACGG 
CATCCCGCCG 
CAAAGGCGCG 
ATATCCGCCT 
GACCGTTTCC 
ATTCAAACGC 
CCGCCGAAGC 
GCGGCAGGAG 
AGGCTCAAAC 
ACAAGATGGC 
TATGCCGCAG 
ACAAGGCATA 
AAGGGATTGG 
CATCCTATCA 
ATCCGCCGTC 
CCCCTTACCA 
AAAGAAAACA 
CAAACTGGCA 
AAGGGTTTCC 
CACCACCACC 



TGCGGGGCTT 
GTTTGCAGTC 
AAGCTGGCGG 
CAATACGGGC 
GCCAAATCGA 
CAAGTATACA 
AATACAAGAT 
GAATCGGCGA 
GGCGGCAGGG 
CGGAAAACTG 
AAATCGAACA 
GATATCAAGC 
TTACAACCAA 
AAGCCCAGGA 
CGCCATATCG 
ATCCTCAGAT 
AGCATTTCGA 
CTTGCCGAGC 
GTTGGGCAAC 
ACATTGTCCG 
AACCATGCCT 
ATTTAGCCTT 
ACGGCTATGA 
AGGGATATAT 
CAACCTGACC 
ACAATGCCGG 
GCCACCCGAT 
CTTCAACGGC 
AAATTGTCGG 
ATTGCTGTCA 
GCGCATCAAC 
CAGCCATCCG 
GAAGCCGTCA 
AGCTGTTCGG 
AGCGGTCGCA 
AGCGACAATT 
TTCCCGAAAT 
TCACCTCCTC 
GACCAACGCC 
GAATTTTGAG 
ACCACTGA 



GCCGATGCAC 
TTTGACGCTG 
CACAAGGTGC 
AAATTGAAGA 
AGTGGACGGG 
AACAAAGCCA 
TCGGAGCATT 
CATAGCGGGC 
CGACATATCG 
ACCTACACCA 
TTTGAAATCG 
CGGATGGAAA 
GCCGAGAAAG 
AGTTGCCGGC 
GCCTTGCCGC 
TTGGCAAACG 
ACCCGACGGG 
GCAGCGGCCA 
CTGATGATTC 
CTTTTCCGAT 
CACATTCCGA 
TACCGCATCC 
CGGGCCACAG 
ACAGCTACGA 
GACAACCGCA 
TAGTATGCTG 
ACAGCCCCGA 
ACTGCAGATA 
CGCAGGCGAT 
TGCACGGCTT 
GATTTGGCAG 
CGATTGGGCA 
GCAATATCTT 
GGAAAATACG 
GATGGGCGCG 
TTGCCGATGC 
ATCCGTTCAA 
AACCGTGCCG 
ACCCGAAGAC 
AAGCACGTGA 



TAACCGCACC 
GATCAGTCCG 
GGAAAAAACT 
ACGACAAGGT 
CAGCTCATTA 
TTCCGCCTTA 
CCGGGAAGAT 
GAACATACAT 
CGGGACGGCG 
TAGATTTCGC 
CCAGAACTCA 
ACGCCATGCC 
GCAGTTACTC 
AGCGCGGAAG 
CAAGCAACTC 
ATTCTTTTAT 
AAATACCACC 
TATCGGATTG 
AACAGGCGGC 
CACGGGCACG 
TTCTGATGAA 
ATTGGGACGG 
GGCGGCGGCT 
CAT AAAAG G C 
GCACCGGACA 
ACGCAAGGAG 
GCTGGACAGA 
TCGTTAAAAA 
GCCGTGCAGG 
GGGTCTGCTT 
ATATGGCGCA 
GTCCAAAACC 
TATGGCAGCC 
GCTTGGGCGG 
ATCGCATTGC 
GGCATACGCC 
ACTTGGAGCA 
CCGTCAAACG 
AGGCGTACCG 
AATATGATAC 



40 



45 



50 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



MVAADIGAGL 
YGNGDSLNTG 
TAFQTEQIQD 
FGSDDAGGKL 
VISGSVLYNQ 
DGGGGTGSSD 
GKIQSHQLGN 
AGSPVDGFSL 
VAQNIRLNLT 
SGNAAEAFNG 
STENKMARIN 
IPIKGIGAVR 
KYPSPYHSRN 
FDGKGFPNFE 



ADALTAPLDH 
KLKNDKVSRF 
SEHSGKMVAK 
TYTIDFAAKQ 
AEKGSYSLGI 
LANDS FIRQV 
LMIQQAAIKG 
YRIHWDGYEH 
DNRSTGQRLA 
TADIVKNIIG 
DLADMAQLKD 
GKYGLGGITA 
IRSNLEQRYG 
KHVKYDTLEH 



KDKGLQSLTL 
DFIRQIEVDG 
RQFRIGDIAG 
GNGKIEHLKS 
FGGKAQEVAG 
LDRQHFEPDG 
NIGYIVRFSD 
HPADGYDGPQ 
DRFHNAGSML 
AAGEIVGAGD 
YAAAAIRDWA 
HPIKRSQMGA 
KENITSSTVP 
HHHHH* 



DQSVRKNEKL 
QLITLESGEF 
EHTSFDKLPE 
PELNVDLAAA 
SAEVKTVNGI 
KYHLFGSRGE 
HGHEVHSPFD 
GGGYPAPKGA 
TQGVGDGFKR 
AVQGISEGSN 
VQN PNAAQGI 
IALPKGKSAV 
PSNGKNVKLA 



KLAAQGAEKT 
QVYKQSHSAL 
GGRATYRGTA 
DIKPDGKRHA 
RHIGLAAKQL 
LAERSGHIGL 
NHASHSDSDE 
RDIYSYDIKG 
ATRYS PELDR 
IAVMHGLGLL 
EAVSNIFMAA 
SDN FADAAYA 
DQRHPKTGVP 



55 Example 16- C-terminaf fusions ('hybrids') with 287/AG287 

[0228] According to the invention, hybrids of two proteins A & B may be either NH 2 -A-B-COOH or NH 2 -B-A-COOH. 
The effect of this difference was investigated using protein 287 either C-terminal (in '287-His' form) or N-terminal (in 
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AG287 form - sequences shown above) to 91 9, 953 and ORF46.1 . A panel of strains was used, including homologous 
strain 2996. FCA was used as adjuvant: 





287 & 919 


287 & 953 


287 & ORF46.1 


Strain 


AG287-919 


919-287 


AG287-953 


953-287 


AG287-46. 1 


46.1-287 


2996 


128000 


16000 


65536 


8192 


16384 


8192 


BZ232 


256 


128 


128 


<4 


<4 


<4 


1000 


2048 


<4 


<4 


<4 


<4 


<4 


MC58 


8192 


1024 


16384 


1024 


512 


128 


NGH38 


32000 


2048 


>2048 


4096 


16384 


4096 


394/98 


4096 


32 


256 


128 


128 


16 


Men A (F6124) 


32000 


2048 


>2048 


32 


8192 


1024 


MenC(BZ133) 


64000 


>8192 


>8192 


<16 


8192 


2048 



Better bactericidal titres are generally seen with 287 at the N-terminus (in the AG form) 

[0229] When fused to protein 961 [NH 2 -AG287-961 -COOH - sequence shown above], the resulting protein is insoluble 
and must be denatured and renatured for purification. Following renaturation, around 50% of the protein was found to 
remain insoluble. The soluble and insoluble proteins were compared, and much better bactericidal titres were obtained 
with the soluble protein (FCA as adjuvant): 





2996 


BZ232 


MC58 


NGH38 


F6124 


BZ133 


Soluble 


65536 


128 


4096 


>2048 


>2048 


4096 


Insoluble 


8192 


<4 


<4 


16 


n.d. 


n.d. 



[0230] Titres with the insoluble form were, however, improved by using alum adjuvant instead: 



Insoluble 


32768 


1128 


4096 


>2048 


>2048 


2048 



Example 17 — N-terminal fusions ('hybrids') to 287 

[0231] Expression of protein 287 as full-length with a C-terminal His-tag, or without its leader peptide but with a C- 
terminal His-tag, gives fairly low expression levels. Better expression is achieved using a N-terminal GST-fusion. 
[0232] As an alternative to using GST as an N-terminal fusion partner, 287 was placed at the C-terminus of protein 
919 ('919-287'), of protein 953 ('953-287'), and of proteins ORF46.1 ('ORF46. 1-287'). In both cases, the leader peptides 
were deleted, and the hybrids were direct in-frame fusions. 

[0233] To generate the 953-287 hybrid, the leader peptides of the two proteins were omitted by designing the forward 
primer downstream from the leader of each sequence; the stop codon sequence was omitted in the 953 reverse primer 
but included in the 287 reverse primer. For the 953 gene, the 5' and the 3' primers used for amplification included a 
A/del and a Bamh\ restriction sites respectively, whereas for the amplification of the 287 gene the 5' and the 3' primers 
included a BamH\ and a Xho\ restriction sites respectively. In this way a sequential directional cloning of the two genes 
in pET21b+, using Nde\-BamH\ (to clone the first gene) and subsequently BamH\-Xho\ (to clone the second gene) could 
be achieved. 

[0234] The 91 9-287 hybrid was obtained by cloning the sequence coding for the mature portion of 287 into the Xho\ 
site at the 3'-end of the 91 9-His clone in pET21 b+. The primers used for amplification of the 287 gene were designed 
for introducing a Sa/I restriction site at the 5'-and a Xho\ site at the 3'- of the PCR fragment. Since the cohesive ends 
produced by the Sa/I and Xho\ restriction enzymes are compatible, the 287 PCR product digested with Sal\-Xho\ could 
be inserted in the pET21 b-919 clone cleaved with Xho\. 
[0235] The ORF46.1 -287 hybrid was obtained similarly. 

[0236] The bactericidal efficacy (homologous strain) of antibodies raised against the hybrid proteins was compared 
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with antibodies raised against simple mixtures of the component antigens: 





Mixture with 287 


Hybrid with 287 


919 


32000 


16000 


953 


8192 


8192 


ORF46.1 


128 


8192 



[0237] Data for bactericidal activity against heterologous MenB strains and against serotypes A and C were also 
obtained for 919-287 and 953-287: 





919 


953 


ORF46.1 


Strain 


Mixture 


Hybrid 


Mixture 


Hybrid 


Mixture 


Hybrid 


MC58 


512 


1024 


512 


1024 




1024 


NGH38 


1024 


2048 


2048 


4096 




4096 


BZ232 


512 


128 


1024 


16 






Men A (F6124) 


512 


2048 


2048 


32 




1024 


MenC (C11) 


>2048 


n.d. 


>2048 


n.d. 




n.d. 


MenC(BZ133) 


>4096 


>8192 


>4096 


<16 




2048 



[0238] Hybrids of ORF46.1 and 91 9 were also constructed. Best results (four-fold higher titre) were achieved with 91 9 
at the N-terminus. 

[0239] Hybrids 91 9-51 9His, ORF97-225His and 225-ORF97His were also tested. These gave moderate ELISA fitres 
and bactericidal antibody responses. 

Example 18- the leader peptide from ORF4 

[0240] As shown above, the leader peptide of ORF4 can be fused to the mature sequence of other proteins (e.g. 
proteins 287 and 919). It is able to direct lipidation in E.coii. 

Example 19- domains in 564 

[0241] The protein '564' is very large (2073aa), and it is difficult to clone and express it in complete form. To facilitate 
expression, the protein has been divided into four domains, as shown in figure 8 (according to the MC58 sequence): 



Domain 


A 


B 


C 


D 


Amino Acids 


79-360 


361-731 


732-2044 


2045-2073 



[0242] These domains show the following homologies: 

• Domain A shows homology to other bacterial toxins: 

gblAAG03431 .1 IAE004443_9probable hemagglutinin [Pseudomonas aeruginosa] (38%) 
gblAAC31981.1l (139897) HecA [Pectobacterium chrysanthemi] (45%) 
emblCAA36409.1l (X52156) filamentous hemagglutinin [Bordetella pertussis] (31%) 
gblAAC79757.1l (AF057695) large supernatant proteinl [Haemophilus ducreyi] (26%) 
gblAAA25657.1l (M30186) HpmA precursor [Proteus mirabilis] (29%) 

• Domain B shows no homology, and is specific to 564. 

• Domain C shows homology to: 



51 



EP 1 790 660 A2 



gblAAF84995.1 IAE004032 HA-like secreted protein [Xylella fastidiosa] (33%) 
gblAAG05850.1IAE004673 hypothetical protein [Pseudomonas aeruginosa] (27%) 
gblAAF68414.1 AF237928 putative FHA [Pasteurella multocisida] (23%) 
gblAAC79757.1 l(AF057695) large supernatant proteinl [Haemophilus ducreyi] (23%) 
pirllS21010 FHA B precursor [Bordetella pertussis] (20%) 

• Domain D shows homology to other bacterial toxins: 

gblAAF84995.1IAE004032_14 HA-like secreted protein [Xylella fastidiosa] (29%) 

[0243] Using the MC58 strain sequence, good intracellular expression of 564ab was obtained in the form of GST- 
fusions (no purification) and his-tagged protein; this domain-pair was also expressed as a lipoprotein, which showed 
moderate expression in the outer membrane/supernatant fraction. 

[0244] The b domain showed moderate intracellular expression when expressed as a his-tagged product (no purifi- 
cation), and good expression as a GST-fusion. 

[0245] The c domain showed good intracellular expression as a GST-fusion, but was insoluble. The d domain showed 
moderate intracellular expression as a his-tagged product (no purification). The cd protein domain-pair showed moderate 
intracellular expression (no purification) as a GST-fusion. 

[0246] Good bactericidal assay titres were observed using the c domain and the be pair. 
Example 20 - the 919 leader peptide 

[0247] The 20mer leader peptide from 919 is discussed in example 1 above: 
MKKYLFRAAL YGIAAAILAA 

[0248] As shown in example 1 , deletion of this leader improves heterologous expression, as does substitution with 
the ORF4 leader peptide. The influence of the 91 9 leader on expression was investigated by fusing the coding sequence 
to the PhoC reporter gene from Morganella morganii [Thaller et al. (1994) Microbiology 140:1341-1350]. The construct 
was cloned in the pET21-b plasmid between the Nde\ and Xho\ sites (Figure 9): 



1 MKKYLFRAAL YGIAAAILAA AIPAGNDATT KPDLYYLKNE QAIDSLKLLP 

51 PPPEVGSIQF LNDQAMYEKG RMLRNTERGK QAQADADLAA GGVATAFSGA 

101 FGYPITEKDS PELYKLLTNM IEDAGDLATR SAKEHYMRIR PFAFYGTETC 

151 NTKDQKKLST NGSYPSGHTS IGWATALVLA EVNPANQDAI LERGYQLGQS 

201 RVICGYHWQS DVDAARIVGS AAVATLHSDP AFQAQLAKAK QEFAQKSQK* 

[0249] The level of expression of PhoC from this plasmid is >200-fold lower than that found for the same construct 
but containing the native PhoC signal peptide. The same result was obtained even after substitution of the T7 promoter 
with the E.coli Plac promoter. This means that the influence of the 919 leader sequence on expression does not depend 
on the promoter used. 

[0250] In order to investigate if the results observed were due to some peculiarity of the 91 9 signal peptide nucleotide 
sequence (secondary structure formation, sensitivity to RNAases, etc.) or to protein instability induced by the presence 
of this signal peptide, a number of mutants were generated. The approach used was a substitution of nucleotides of the 
919 signal peptide sequence by cloning synthetic linkers containing degenerate codons. In this way, mutants were 
obtained with nucleotide and/or amino acid substitutions. 

[0251 ] Two different linkers were used, designed to produce mutations in two different regions of the 91 9 signal peptide 
sequence, in the first 19 base pairs (L1) and between bases 20-36 (S1). 

LI: 5' T ATG AAa/g TAc/t c/tTN TTt/c a/cGC GCC GCC CTG TAC GGC ATC GCC GCC 
GCC ATC CTC GCC GCC GCG ATC CC 3' 

SI: 5' T ATG AAA AAA TAC CTA TTC CGa/g GCN GCN c/tTa/g TAc/t GGc/g ATC GCC 
GCC GCC ATC CTC GCC GCC GCG ATC CC 3' 

[0252] The alignment of some of the mutants obtained is given below. 
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LI mutants : 

9Ll-a ATGAAGAAGTACCTTTTCAGCGCCGCC — — 

9Ll-e ATGAAAAAATACTTTTTCCGCGCCGCC— — ~ — 

9Ll-d ATGAAAAAATACTTTTTCCGCGCCGCC ~ — 

9Ll-f ATGAAAAAATATCTCTTTAGCGCCGCCCTGTACGGCATCGCCGCCGCCATCCTCGCCGCC 

919sp ATGAAAAAATACCTATTCCGCGCCGCCCTGTACGGCATCGCCGCCGCCATCCTCGCCGCC 

9Lla MKKYLFSAA-- — — 

9Lle MKKYFFRAA ~ 

9Lld MKKYFFRAA 

9Llf MKKYLFSAALYGIAAAILAA 

919sp MKKYLFRAALYGIAAAILAA (i.e. native signal peptide) 



SI mutants: 

9Sl-e ATGAAAAAATACCTATTC ATCGCCGCCGCCATCCTCGCCGCC 

9S1-C ATGAAAAAATACCTATTCCGAGCTGCCCAATACGGCATCGCCGCCGCCATCCTCGCCGCC 
9Sl-b ATGAAAAAATACCTATTCCGGGCCGCCCAATACGGCATCGCCGCCGCCATCCTCGCCGCC 
9Sl-i ATGAAAAAATACCTATTCCGGGCGGCTTTGTACGGGATCGCCGCCGCCATCCTCGCCGCC 
919sp ATGAAAAAATACCTATTCCGCGCCGCCCTGTACGGCATCGCCGCCGCCATCCTCGCCGCC 

MKKYLF IAAAILAA 

MKKYLFRAAQYGIAAAILAA 
MKKYLFRAAQ YGIAAAI LAA 
MKKYLFRAALYGIAAAILAA 
MKKYLFRAAL YGIAAAI LAA 
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9Sle 
9Slc 
9Slb 
9Sli 
919sp 



[0253] As shown in the sequences alignments, most of the mutants analysed contain in-frame deletions which were 
unexpectedly produced by the host cells. 
30 [0254] Selection of the mutants was performed by transforming E. coli BL21 (DE3) cells with DNA prepared from a 
mixture of L1 and S1 mutated clones. Single transformants were screened for high PhoC activity by streaking them onto 
LB plates containing 100 jxg/ml ampicillin, 50|xg/ml methyl green, 1 mg/ml PDP (phenolphthaleindiphosphate). On this 
medium PhoC-producing cells become green (Figure 10). 

[0255] A quantitative analysis of PhoC produced by these mutants was carried out in liquid medium using pNPP as 
35 a substrate for PhoC activity. The specific activities measured in cell extracts and supernatants of mutants grown in 
liquid medium for 0, 30, 90, 180 min. were: 

CELL EXTRACTS 

40 [0256] 





0 


30 


90 


180 


control 


0,00 


0,00 


0,00 


0,00 


9phoC 


1,11 


1,11 


3,33 


4,44 


9S1e 


102,12 


111,00 


149,85 


172,05 


9L1a 


206,46 


111,00 


94,35 


83,25 


9L1d 


5,11 


4,77 


4,00 


3,11 


9L1f 


27,75 


94,35 


82,14 


36,63 


9S1b 


156,51 


111,00 


72,15 


28,86 


9S1c 


72,15 


33,30 


21,09 


14,43 


9S1i 


156,51 


83,25 


55,50 


26,64 


phoCwt 


194,25 


180,93 


149,85 


142,08 
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SUPERNATANTS 
[0257] 



10 



20 





0 


30 


90 


1801 


control 


0,00 


0,00 


0,00 


0,00 


9phoC 


0,33 


0,00 


0,00 


0,00 


9S1e 


0,11 


0,22 


0,44 


0,89 


9L1a 


4,88 


5,99 


5,99 


7,22 


9L1d 


0,11 


0,11 


0,11 


0,11 


9L1f 


0,11 


0,22 


0,11 


0,11 


9S1b 


1,44 


1,44 


1,44 


1,67 


9S1c 


0,44 


0,78 


0,56 


0,67 


9S1i 


0,22 


0,44 


0,22 


0,78 


phoCwt 


34,41 


43,29 


87,69 


177,60 



[0258] Some of the mutants produce high amounts of PhoC and in particular, mutant 9L1a can secrete PhoC in the 
culture medium. This is noteworthy since the signal peptide sequence of this mutant is only 9 amino acids long. This is 
the shortest signal peptide described to date. 

25 

Example 21 — C-terminal deletions of Maf -related proteins 

[0259] MafB-related proteins include 730, ORF46 and ORF29. 
[0260] The 730 protein from MC58 has the following sequence: 

30 



1 VKPLRRLTNL LAACAVAAAA LIQPALA ADL AQDPFITDNA QRQHYEPGGK 

51 YHLFGDPRGS VSDRTGKINV IQDYTHQMGN LLIQQANING TIGYHTRFSG 

101 HGHEEHAPFD NHAADSASEE KGNVDEGFTV YRLNWEGHEH HPADAYDGPK 

151 GGNYPKPTGA RDEYTYHVNG TARSIKLNPT DTRSIRQRIS DNYSNLGSNF 

201 SDRADEANRK MFEHNAKLDR WGNSMEFING VAAGALNPFI SAGEALGIGD 

251 ILYGTRYAID KAAMRNIAPL PAEGKFAVIG GLGSVAGFEK NTREAVDRWI 

301 QENPNAAETV EAVFNVAAAA KVAKLAKAAK PGKAAVSGDF ADSYKKKLAL 



351 SDSARQLYQN AKYREALDIH YEDLIRRKTD GSSKFINGRE IDAVTNDALI 
401 QAKRTISAID KPKNFLNQKN RKQIKATIEA ANQQGKRAEF WFKYGVHSQV 
4 51 KSYIESKGGI VKTGLGD* 



[0261] The leader peptide is underlined. 

[0262] 730 shows similar features to ORF46 (see example 8 above): 

as for Orf46, the conservation of the 730 sequence among MenB, MenA and gonococcus is high (>80%) only for 
the N-terminal portion. The C-terminus, from -340, is highly divergent. 

its predicted secondary structure contains a hydrophobic segment spanning the central region of the molecule (aa. 
227-247). 

expression of the full-length gene in E. coli gives very low yields of protein. Expression from tagged or untagged 
constructs where the signal peptide sequence has been omitted has a toxic effect on the host cells. In other words, 
the presence of the full-length mature protein in the cytoplasm is highly toxic for the host cell while its translocation 
to the periplasm (mediated by the signal peptide) has no detectable effect on cell viability. This "intracellular toxicity" 
of 730 is particularly high since clones for expression of the leaderless 730 can only be obtained at very lowfrequency 
using a recA genetic background (E. coli strains: HB101 for cloning; HMS1 74(DE3) for expression). 
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[0263] To overcome this toxicity, a similar approach was used for 730 as described in example 8 for ORF46. Four C- 
terminal truncated forms were obtained, each of which is well expressed. All were obtained from intracellular expression 
of His-tagged leaderless 730. 

[0264] Form A consists of the N-terminal hydrophilic region of the mature protein (aa. 28-226). This was purified as 
a soluble His-tagged product, having a higher-than-expected MW. 

[0265] Form B extends to the end of the region conserved between serogroups (aa. 28-340). This was purified as an 
insoluble His-tagged product. 

[0266] The C-terminal truncated forms named C1 and C2 were obtained after screening for clones expressing high 
levels of 730-His clones in strain HMS1 74(DE3). Briefly, the pET21 b plasmid containing the His-tagged sequence coding 
for the full-length mature 730 protein was used to transform the recA strain HMS 1 74(DE3). Transformants were obtained 
at low frequency which showed two phenotypes: large colonies and very small colonies. Several large and small colonies 
were analysed for expression of the 730-His clone. Only cells from large colonies over-expressed a protein recognised 
by anti-730A antibodies. However the protein over-expressed in different clones showed differences in molecular mass. 
Sequencing of two of the clones revealed that in both cases integration of an E, coli IS sequence had occurred within 
the sequence coding for the C terminal region of 730. The two integration events have produced in-frame fusion with 1 
additional codon in the case of C1 , and 1 2 additional codons in the case of C2 (Figure 11). The resulting "mutant" forms 
of 730 have the following sequences: 



730-Cl (due to an IS1 insertion 

1 MADLAQDPFI TDNAQRQHYE 

51 QMGNLLIQQA NINGTIGYHT 

101 GFTVYRLNWE GHEHHPADAY 

151 LNPTDTRSIR QRISDNYSNL 

201 FINGVAAGAL NPFISAGEAL 

251 AVIGGLGSVA GFEKNTREAV 

301 KAAKPGKAAV SGDFADSYKK 

351 RKTDGSSKFI NGREIDAVTN 



- figure 11A) 

PGGKYHLFGD PRGSVSDRTG KINVIQDYTH 
RFSGHGHEEH APFDNHAADS ASEEKGNVDE 
DGPKGGNYPK PTGARDEYTY HVNGTARSIK 
GSNFSDRADE ANRKMFEHNA KLDRWGNSME 
GIGDILYGTR YAI DKAAMRN IAPLPAEGKF 
DRWIQENPNA AETVEAVFNV AAAAKVAKLA 
KLALSDSARQ LYQNAKYREA LDIHYEDLIR 
DALIQAR* 



[0267] The additional amino acid produced by the insertion is underlined. 



730-C2 (due to an IS5 insertion 


1 


MADLAQDPFI 


TDNAQRQHYE 


51 


QMGNLLIQQA 


NINGTIGYHT 


101 


GFTVYRLNWE 


GHEHHPADAY 


151 


LNPTDTRSIR 


QRISDNYSNL 


201 


FINGVAAGAL 


NPFISAGEAL 


251 


AVIGGLGSVA 


GFEKNTREAV 


301 


KAAKPGKAAV 


SGDFADSYKK 


351 


LLG* 





- Figure 11B) 

PGGKYHLFGD PRGSVSDRTG KINVIQDYTH 
RFSGHGHEEH APFDNHAADS ASEEKGNVDE 
DGPKGGNYPK PTGARDEYTY HVNGTARSIK 
GSNFSDRADE ANRKMFEHNA KLDRWGNSME 
GIGDILYGTR YAI DKAAMRN IAPLPAEGKF 
DRWIQENPNA AETVEAVFNV AAAAKVAKLA 
KLALSDSARQ LYQNAKYREA L GKVRISGEI 



[0268] The additional amino acids produced by the insertion are underlined. 

[0269] In conclusion, intracellular expression of the 730-C1 form gives very high level of protein and has no toxic effect 
on the host cells, whereas the presence of the native C-terminus is toxic. These data suggest that the "intracellular 
toxicity" of 730 is associated with the C-terminal 65 amino acids of the protein. 

[0270] Equivalent truncation of ORF29 to the first 231 or 368 amino acids has been performed, using expression with 
or without the leader peptide (amino acids 1-26; deletion gives cytoplasmic expression) and with or without a His-tag. 

Example 22 - domains in 961 

[0271] As described in example 9 above, the GST-fusion of 961 was the best-expressed in E.coli. To improve ex- 
pression, the protein was divided into domains (figure 12). 

[0272] The domains of 961 were designed on the basis of YadA (an adhesin produced by Yersinia which has been 
demonstrated to be an adhesin localized on the bacterial surface that forms oligomers that generate surface projection 
[Hoiczyk et al. (2000) EMBO J 19:5989-99]) and are: leader peptide, head domain, coiled-coil region (stalk), and mem- 
brane anchor domain. 

[0273] These domains were expressed with or without the leader peptide, and optionally fused either to C-terminal 
His-tag or to N-terminal GST. E.coli clones expressing different domains of 961 were analyzed by SDS-PAGE and 
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western blot for the production and localization of the expressed protein, from over-night (o/n) culture or after 3 hours 
induction with IPTG. The results were: 





Total lysate 
(Western 
Blot) 


Periplasm 
(Western Blot) 


Supernatant 
(Western Blot) 


OMV 
SDS-PAGE 


961 (o/n) 
961 (IPTG) 


+/- 


- 


- 




961 -L (o/n) 
961 -L (IPTG) 


+ 
+ 






+ 
+ 


961 c-L (o/n) 
961 c-L (IPTG) 


+ 


+ 


+ 




961A r L (o/n) 
961A r L (IPTG) 


+ 






+ 



[0274] The results show that in E. coir. 

■ 961 -L is highly expressed and localized on the outer membrane. By western blot analysis two specific bands 
have been detected: one at ~45kDa (the predicted molecular weight) and one at ~180kDa, indicating that 961 -L 
can form oligomers. Additionally, these aggregates are more expressed in the over-night culture (without IPTG 
induction). OMV preparations of this clone were used to immunize mice and serum was obtained. Using overnight 
culture (predominantly by oligomeric form) the serum was bactericidal; the IPTG-induced culture (predominantly 
monomeric) was not bactericidal. 

■ 961A-,-L (with a partial deletion in the anchor region) is highly expressed and localized on the outer membrane, 
but does not form oligomers; 

■ the 961 c-L (without the anchor region) is produced in soluble form and exported in the supernatant. 
[0275] Titres in ELISA and in the serum bactericidal assay using His-fusions were as follows: 





ELISA 


Bactericidal 


961a (aa 24-268) 


24397 


4096 


961 b(aa 269-405) 


7763 


64 


961 c-L 


29770 


8192 


961c (2996) 


30774 


>65536 


961c (MC58) 


33437 


16384 


961 d 


26069 


>65536 



[0276] E.coli clones expressing different forms of 961 (961 , 961 -L, 961A-|-L and 961 c-L) were used to investigate if 
the 961 is an adhesin (c.f. YadA). An adhesion assay was performed using (a) the human epithelial cells and (b) E.coli 
clones after either over-night culture or three hours IPTG induction. 961 -L grown over-night (961 A-|-L) and IPTG-induced 
961 c-L (the clones expressing protein on surface) adhere to human epithelial cells. 

[0277] 961 c was also used in hybrid proteins (see above). As 961 and its domain variants direct efficient expression, 
they are ideally suited as the N-terminal portion of a hybrid protein. 

Example 23 — further hybrids 

[0278] Further hybrid proteins of the invention are shown below (see also Figure 14). These are advantageous when 
compared to the individual proteins: 
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ORF46. 1-741 



10 



15 



20 



25 



30 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 



ATGTCAGATT 
GCATTTCGAA 
TTGCCGAGCG 
TTGGGCAACC 
CATTGTCCGC 
ACCATGCCTC 
TTTAGCCTTT 
CGGCTATGAC 
GGGATATATA 
AACCTGACCG 
CAATGCCGGT 
CCACCCGATA 
TTCAACGGCA 
AATTGTCGGC 
TTGCTGTCAT 
CGCATCAACG 
AGCCATCCGC 
AAGCCGTCAG 
GCTGTTCGGG 
GCGGTCGCAG 
GCGACAATTT 
TCCCGAAATA 
CACCTCCTCA 
ACCAACGCCA 
AATTTTGAGA 
CGCCGCCGAC 
ACCATAAAGA 
AAAAACGAGA 
AAACGGTGAC 
GTTTCGACTT 
GAGAGTGGAG 
CTTTCAGACC 
CGAAACGCCA 



TGGCAAACGA 
CCCGACGGGA 
CAGCGGCCAT 
TGATGATTCA 
TTTTCCGATC 
ACATTCCGAT 
ACCGCATCCA 
GGGCCACAGG 
CAGCTACGAC 
ACAACCGCAG 
AGTATGCTGA 
CAGCCCCGAG 
CTGCAGATAT 
GCAGGCGATG 
GCACGGCTTG 
ATTTGGCAGA 
GATTGGGCAG 
CAATATCTTT 
GAAAATACGG 
ATGGGCGCGA 
TGCCGATGCG 
TCCGTTCAAA 
ACCGTGCCGC 
CCCGAAGACA 
AGCACGTGAA 
ATCGGTGCGG 
CAAAGGTTTG 
AACTGAAGCT 
AGCCTCAATA 
TATCCGCCAA 
AGTTCCAAGT 
GAG C AAAT AC 
GTTCAGAATC 



TTCTTTTATC 
AATACCACCT 
ATCGGATTGG 
ACAGGCGGCC 
ACGGGCACGA 
TCTGATGAAG 
TTGGGACGGA 
GCGGCGGCTA 
ATAAAAGGCG 
CACCGGACAA 
CGCAAGGAGT 
CTGGACAGAT 
CGTTAAAAAC 
CCGTGCAGGG 
GGTCTGCTTT 
TATGGCGCAA 
TCCAAAACCC 
ATGGCAGCCA 
CTTGGGCGGC 
TCGCATTGCC 
GCATACGCCA 
CTTGGAGCAG 
CGTCAAACGG 
GGCGTACCGT 
AT AT GAT AC G 
GGCTTGCCGA 
CAGTCTTTGA 
GGCGGCACAA 
CGGGCAAATT 
ATCGAAGTGG 
ATACAAACAA 
AAGATTCGGA 
GGCGACATAG 



CGGCAGGTTC 
ATTCGGCAGC 
GAAAAATACA 
ATTAAAGGAA 
AGTCCATTCC 
CCGGTAGTCC 
TACGAACACC 
TCCCGCTCCC 
TTGCCCAAAA 
CGGCTTGCCG 
AGGCGACGGA 
CGGGCAATGC 
ATCATCGGCG 
CATAAGCGAA 
CCACCGAAAA 
CTCAAAGACT 
CAATGCCGCA 
TCCCCATCAA 
ATCACGGCAC 
GAAAGGGAAA 
AATACCCGTC 
CGTTACGGCA 
CAAAAATGTC 
TTGACGGTAA 
GGATCCGGAG 
TGCACTAACC 
CGCTGGATCA 
GGTGCGGAAA 
GAAGAACGAC 
ACGGGCAGCT 
AGCCATTCCG 
GCATTCCGGG 
CGGGCGAACA 



TCGACCGTCA 
AGGGGGGAAC 
AAGCCATCAG 
ATATCGGCTA 
CCCTTCGACA 
CGTTGACGGA 
ATCCCGCCGA 
AAAGGCGCGA 
TATCCGCCTC 
ACCGTTTCCA 
TTCAAACGCG 
CGCCGAAGCC 
CGGCAGGAGA 
GGCTCAAACA 
CAAGATGGCG 
ATGCCGCAGC 
CAAGGCATAG 
AGGGATTGGA 
ATCCTATCAA 
TCCGCCGTCA 
CCCTTACCAT 
AAGAAAACAT 
AAACTGGCAG 
AGGGTTTCCG 
GGGGTGGTGT 
GCACCGCTCG 
GTCCGTCAGG 
AAACTTATGG 
AAGGTCAGCC 
CATTACCTTG 
CCTTAACCGC 
AAGATGGTTG 
TACATCTTTT 



35 



40 



1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 



GACAAGCTTC 
TTCAGACGAT 
AGCAGGGAAA 
GACCTGGCCG 
CAGCGGTTCC 
GTATCTTTGG 
ACCGTAAACG 
CCACCACCAC 



CCGAAGGCGG 
GCCGGCGGAA 
CGGCAAAATC 
CCGCCGATAT 
GTCCTTTACA 
CGGAAAAGCC 
GCATACGCCA 
CACCACTGA 



CAGGGCGACA 
AACTGACCTA 
GAACATTTGA 
CAAGCCGGAT 
ACCAAGCCGA 
CAGGAAGTTG 
TATCGGCCTT 



TATCGCGGGA 
CACCATAGAT 
AATCGCCAGA 
GGAAAACGCC 
GAAAGGCAGT 
CCGGCAGCGC 
GCCGCCAAGC 



CGGCGTTCGG 
TTCGCCGCCA 
ACTCAATGTC 
ATGCCGTCAT 
TACTCCCTCG 
GGAAGTGAAA 
AACTCGAGCA 



45 



50 



55 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 



MSDLANDSFI 
LGNLMIQQAA 
FSLYRIHWDG 
NLTDNRSTGQ 
FNGTADIVKN 
RINDLADMAQ 
AVRGKYGLGG 
SRNIRSNLEQ 
NFEKHVKYDT 
KNEKLKLAAQ 
ESGEFQVYKQ 
DKLPEGGRAT 
DLAAADIKPD 
TVNGIRHIGL 



RQVLDRQHFE 
IKGNIGYIVR 
YEHHPADGYD 
RLADRFHNAG 
IIGAAGEIVG 
LKDYAAAAIR 
ITAHPIKRSQ 
RYGKENITSS 
GSGGGGVAAD 
GAEKTYGNGD 
SHSALTAFQT 
YRGTAFGSDD 
GKRHAVISGS 
AAKQLEHHHH 



PDGKYHLFGS 
FSDHGHEVHS 
GPQGGGYPAP 
SMLTQGVGDG 
AGDAVQGISE 
DWAVQNPNAA 
MGAIALPKGK 
TVPPSNGKNV 
I GAG LAD ALT 
SLNTGKLKND 
EQIQDSEHSG 
AGGKLTYTID 
VLYNQAEKGS 
HH* 



RGELAERSGH 
PFDNHASHSD 
KGARDIYSYD 
FKRATRYSPE 
GSNIAVMHGL 
QGIEAVSNIF 
SAVSDNFADA 
KLADQRHPKT 
APLDHKDKGL 
KVSRFDFIRQ 
KMVAKRQFRI 
FAAKQGNGKI 
YSLGIFGGKA 



IGLGKIQSHQ 
SDEAGSPVDG 
IKGVAQNIRL 
LDRSGNAAEA 
GLLSTENKMA 
MAAIPIKGIG 
AYAKYPSPYH 
GVPFDGKGFP 
QSLTLDQSVR 
IEVDGQLITL 
GDIAGEHTSF 
EHLKSPELNV 
QEVAGSAEVK 
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ORF46. 1-961 



10 



15 



20 



25 



30 



35 



1 

51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 



ATGTCAGATT 
GCATTTCGAA 
TTGCCGAGCG 
TTGGGCAACC 
CATTGTCCGC 
ACCATGCCTC 
TTTAGCCTTT 
CGGCTATGAC 
GGGATATATA 
AACCTGACCG 
CAATGCCGGT 
CCACCCGATA 
TTCAACGGCA 
AATTGTCGGC 
TTGCTGTCAT 
CGCATCAACG 
AGCCATCCGC 
AAGCCGTCAG 
GCTGTTCGGG 
GCGGTCGCAG 
GCGACAATTT 
TCCCGAAATA 
CACCTCCTCA 
ACCAACGCCA 
AATTTTGAGA 
CACAAACGAC 
CCTACAACAA 
TACGACATTG 
CGATGTTGAA 
CTAACCTGAC 
GTAAAAGCTG 
CACTGATGCC 
ACGCCTTGAA 
AAGACAAATA 
CGTCGACAAG 
AAACCAACAC 
CAGACGGCCG 
AGAAACTGCA 
CAGCCGACAA 
GATATCGCTA 
CGTGTACACC 



TGGCAAACGA 
CCCGACGGGA 
CAGCGGCCAT 
TGATGATTCA 
TTTTCCGATC 
ACATTCCGAT 
ACCGCATCCA 
GGGCCACAGG 
CAGCTACGAC 
ACAACCGCAG 
AGTATGCTGA 
CAGCCCCGAG 
CTGCAGATAT 
GCAGGCGATG 
GCACGGCTTG 
ATTTGGCAGA 
GATTGGGCAG 
CAATATCTTT 
GAAAATACGG 
ATGGGCGCGA 
TGCCGATGCG 
TCCGTTCAAA 
ACCGTGCCGC 
CCCGAAGACA 
AGCACGTGAA 
GACGATGTTA 
TGGCCAAGAA 
ATGAAGACGG 
GCCGACGACT 
CAAAACCGTC 
CAGAATCTGA 
GCTTTAGCAG 
TAAATTGGGA 
TCGTAAAAAT 
CATGCCGAAG 
TAAGGCAGAC 
AAGAAACCAA 
GCAGGCAAAG 
GGCCGAAGCT 
CGAACAAAGA 
AGAGAAGAGT 



TTCTTTTATC 
AATACCACCT 
ATCGGATTGG 
ACAGGCGGCC 
ACGGGCACGA 
TCTGATGAAG 
TTGGGACGGA 
GCGGCGGCTA 
ATAAAAGGCG 
CACCGGACAA 
CGCAAGGAGT 
CTGGACAGAT 
CGTTAAAAAC 
CCGTGCAGGG 
GGTCTGCTTT 
TATGGCGCAA 
TCCAAAACCC 
ATGGCAGCCA 
CTTGGGCGGC 
TCGCATTGCC 
GCATACGCCA 
CTTGGAGCAG 
CGTCAAACGG 
GGCGTACCGT 
ATATGATACG 
AAAAAGCTGC 
ATCAACGGTT 
CACAATTACC 
TTAAAGGTCT 
AATGAAAACA 
AATAGAAAAG 
ATACTGATGC 
GAAAATATAA 
TGATGAAAAA 
CAT T C AAC G A 
GAAGCCGTCA 
ACAAAACGTC 
CCGAAGCTGC 
GTCGCTGCAA 
TAATATTGCT 
CTGACAGCAA 



CGGCAGGTTC 
ATTCGGCAGC 
GAAAAATACA 
ATTAAAGGAA 
AGTCCATTCC 
CCGGTAGTCC 
TACGAACACC 
TCCCGCTCCC 
TTGCCCAAAA 
CGGCTTGCCG 
AGGCGACGGA 
CGGGCAATGC 
ATCATCGGCG 
CATAAGCGAA 
CCACCGAAAA 
CTCAAAGACT 
CAATGCCGCA 
TCCCCATCAA 
ATCACGGCAC 
GAAAGGGAAA 
AATACCCGTC 
CGTTACGGCA 
CAAAAATGTC 
TTGACGGTAA 
GGATCCGGAG 
CACTGTGGCC 
TCAAAGCTGG 
AAAAAAGACG 
GGGTCTGAAA 
AACAAAACGT 
T T AAC AAC C A 
CGCTCTGGAT 
CGACATTTGC 
TTAGAAGCCG 
TATCGCCGAT 
AAACCGCCAA 
GATGCCAAAG 
CGCTGGCACA 
AAGTTACCGA 
AAAAAAGCAA 
ATTTGTCAGA 



TCGACCGTCA 
AGGGGGGAAC 
AAGCCATCAG 
ATATCGGCTA 
CCCTTCGACA 
CGTTGACGGA 
ATCCCGCCGA 
AAAGGCGCGA 
TATCCGCCTC 
ACCGTTTCCA 
TTCAAACGCG 
CGCCGAAGCC 
CGGCAGGAGA 
GGCTCAAACA 
CAAGATGGCG 
ATGCCGCAGC 
CAAGGCATAG 
AG GG AT T G G A 
ATCCTATCAA 
TCCGCCGTCA 
CCCTTACCAT 
AAGAAAACAT 
AAACTGGCAG 
AGGGTTTCCG 
GAGGAGGAGC 
ATTGCTGCTG 
AG AG AC CAT C 
CAACTGCAGC 
AAAGTCGTGA 
CGATGCCAAA 
AGTTAGCAGA 
GCAACCACCA 
TGAAGAGACT 
TGGCTGATAC 
TCATTGGATG 
TGAAGCCAAA 
TAAAAGCTGC 
GCTAATACTG 
CAT C AAAG CT 
ACAGTGCCGA 
ATTGATGGTC 
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2051 
2101 
2151 
2201 
2251 
2301 
2351 
2401 



TGAACGCTAC 
TCCATTGCCG 
AGACCTGCGC 
CCGGTCTGTT 
GTCGGCGGCT 
CTTTACCGAA 
CCGGTTCTTC 
CACCACCACC 



TACCGAAAAA 
ATCACGATAC 
AAAGAAACCC 
CCAACCTTAC 
ACAAATCCGA 
AACTTTGCCG 
CGCAGCCTAC 
ACCACCACTG 



TTGGACACAC 
TCGCCTGAAC 
GCCAAGGCCT 
AACGTGGGTC 
ATCGGCAGTC 
C C AAAG C AG G 
CATGTCGGCG 
A 



GCTTGGCTTC 
GGTTTGGATA 
TGCAGAACAA 
GGTTCAATGT 
GCCATCGGTA 
CGTGGCAGTC 
TCAATTACGA 



TGCTGAAAAA 
AAACAGTGTC 
GCCGCGCTCT 
AACGGCTGCA 
CCGGCTTCCG 
GGCACTTCGT 
GTGGCTCGAG 
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1 MSDLANDSFI RQVLDRQHFE PDGKYHLFGS RGELAERSGH IGLGKIQSHQ 

51 LGNLMIQQAA IKGNIGYIVR FSDHGHEVHS PFDNHASHSD SDEAGSPVDG 

101 FSLYRIHWDG YEHHPADGYD GPQGGGYPAP KGARDIYSYD IKGVAQNIRL 

151 NLTDNRSTGQ RLADRFHNAG SMLTQGVGDG FKRATRYSPE LDRSGNAAEA 

201 FNGTADIVKN IIGAAGEIVG AGDAVQGISE GSNIAVMHGL GLLSTENKMA 

251 RINDLADMAQ LKDYAAAAIR DWAVQNPNAA QGIEAVSNIF MAAIPIKGIG 

301 AVRGKYGLGG ITAHPIKRSQ MGAIALPKGK SAVSDNFADA AYAKYPSPYH 

351 SRNIRSNLEQ RYGKENITSS TVPPSNGKNV KLADQRHPKT GVPFDGKGFP 

4 01 NFEKHVKYDT GSGGGGATND DDVKKAATVA IAAAYNNGQE INGFKAGETI 

10 451 YDIDEDGTIT KKDATAADVE ADDFKGLGLK KVVTNLTKTV NENKQNVDAK 

501 VKAAESEIEK LTTKLADTDA ALADTDAALD ATTNALNKLG ENITTFAEET 

551 KTNIVKIDEK LEAVADTVDK HAEAFNDIAD SLDETNTKAD EAVKTANEAK 

601 QTAEETKQNV DAKVKAAETA AGKAEAAAGT ANTAADKAEA VAAKVTDIKA 

651 DIATNKDNIA KKANSADVYT REESDSKFVR IDGLNATTEK LDTRLASAEK 

7 01 SIADHDTRLN GLDKTVSDLR KETRQGLAEQ AALSGLFQPY NVGRFNVTAA 

15 751 VGGYKSESAV AIGTGFRFTE NFAAKAGVAV GTSSGSSAAY HVGVNYEWLE 

801 HHHHHH* 
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20 ORF46. l-961c 

1 ATGTCAGATT TGGCAAACGA TTCTTTTATC CGGCAGGTTC TCGACCGTCA 

51 GCATTTCGAA CCCGACGGGA AATACCACCT ATTCGGCAGC AGGGGGGAAC 

101 TTGCCGAGCG CAGCGGCCAT ATCGGATTGG GAAAAATACA AAG C CAT C AG 

151 TTGGGCAACC TGATGATTCA ACAGGCGGCC ATTAAAGGAA ATATCGGCTA 

201 CATTGTCCGC TTTTCCGATC ACGGGCACGA AGTCCATTCC CCCTTCGACA 

25 251 ACCATGCCTC ACATTCCGAT TCTGATGAAG CCGGTAGTCC CGTTGACGGA 

301 TTTAGCCTTT ACCGCATCCA TTGGGACGGA TACGAACACC ATCCCGCCGA 

351 CGGCTATGAC GGGCCACAGG GCGGCGGCTA TCCCGCTCCC AAAGGCGCGA 

401 GGGATATATA CAGCTACGAC ATAAAAGGCG TTGCCCAAAA TATCCGCCTC 

4 51 AACCTGACCG ACAACCGCAG CACCGGACAA CGGCTTGCCG ACCGTTTCCA 

501 CAATGCCGGT AGTATGCTGA CGCAAGGAGT AGGCGACGGA TTCAAACGCG 

551 CCACCCGATA CAGCCCCGAG CTGGACAGAT CGGGCAATGC CGCCGAAGCC 

601 TTCAACGGCA CTGCAGATAT CGTTAAAAAC ATCATCGGCG CGGCAGGAGA 

651 AATTGTCGGC GCAGGCGATG CCGTGCAGGG CATAAGCGAA GGCTCAAACA 

701 TTGCTGTCAT GCACGGCTTG GGTCTGCTTT CCACCGAAAA C AAG AT GG C G 

7 51 CGCATCAACG ATTTGGCAGA TATGGCGCAA CTCAAAGACT ATGCCGCAGC 

8 01 AGCCATCCGC GATTGGGCAG TCCAAAACCC CAATGCCGCA CAAGGCATAG 
8 51 AAGCCGTCAG CAATATCTTT ATGGCAGCCA TCCCCATCAA AGGGATTGGA 
901 GCTGTTCGGG GAAAATACGG CTTGGGCGGC ATCACGGCAC ATCCTATCAA 
951 GCGGTCGCAG ATGGGCGCGA TCGCATTGCC GAAAGGGAAA TCCGCCGTCA 

1001 GCGACAATTT TGCCGATGCG GCATACGCCA AATACCCGTC CCCTTACCAT 

1051 TCCCGAAATA TCCGTTCAAA CTTGGAGCAG CGTTACGGCA AAGAAAACAT 

40 1101 CACCTCCTCA ACCGTGCCGC CGTCAAACGG CAAAAATGTC AAACTGGCAG 

1151 ACCAACGCCA CCCGAAGACA GGCGTACCGT TTGACGGTAA AGGGTTTCCG 

1201 AATTTTGAGA AGCACGTGAA ATATGATACG GGATCCGGAG GAGGAGGAGC 

1251 CACAAACGAC GACGATGTTA AAAAAGCTGC CACTGTGGCC ATTGCTGCTG 

1301 CCTACAACAA TGGCCAAGAA AT CAACGGTT TCAAAGCTGG AG AG AC CATC 

1351 TACGACATTG ATGAAGACGG CACAATTACC AAAAAAGACG CAACTGCAGC 

45 1401 CGATGTTGAA GCCGACGACT TTAAAGGTCT GGGTCTGAAA AAAGTCGTGA 

14 51 CTAACCTGAC CAAAACCGTC AATGAAAACA AACAAAACGT CGATGCCAAA 

1501 GTAAAAGCTG CAGAATCTGA AATAGAAAAG TTAACAACCA AGTTAGCAGA 

1551 CACTGATGCC GCTTTAGCAG ATACTGATGC CGCTCTGGAT GCAACCACCA 

1601 ACGCCTTGAA TAAATTGGGA GAAAATATAA CGACATTTGC TGAAGAGACT 

1651 AAG AC AAAT A TCGTAAAAAT TGATGAAAAA TTAGAAGCCG TGGCTGATAC 

1701 CGTCGACAAG CATGCCGAAG CATTCAACGA TATCGCCGAT TCATTGGATG 

17 51 AAACCAACAC TAAGGCAGAC GAAGCCGTCA AAACCGCCAA TGAAGCCAAA 

1801 CAGACGGCCG AAGAAACCAA ACAAAACGTC GATGCCAAAG TAAAAGCTGC 

1851 AGAAACTGCA GCAGGCAAAG CCGAAGCTGC CGCTGGCACA GCTAATACTG 
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CAGCCGACAA GGCCGAAGCT GTCGCTGCAA AAGTTACCGA CATCAAAGCT 
GATATCGCTA CGAACAAAGA TAATATTGCT AAAAAAGCAA ACAGTGCCGA 
CGTGTACACC AGAGAAGAGT CTGACAGCAA ATTTGTCAGA ATTGATGGTC 
TGAACGCTAC TACCGAAAAA TTGGACACAC GCTTGGCTTC TGCTGAAAAA 
TCCATTGCCG ATCACGATAC TCGCCTGAAC GGTTTGGATA AAACAGTGTC 
AGACCTGCGC AAAGAAACCC GCCAAGGCCT TGCAGAACAA GCCGCGCTCT 
CCGGTCTGTT CCAACCTTAC AACGTGGGTC TCGAGCACCA CCACCACCAC 
CACTGA 



MSDLANDSFI 
LGNLMIQQAA 
FSLYRIHWDG 
NLTDNRSTGQ 
FNGTADIVKN 
RINDLADMAQ 
AVRGKYGLGG 
SRNIRSNLEQ 
NFEKHVKYDT 
YDIDEDGTIT 
VKAAESEIEK 
KTNIVKIDEK 
QTAEETKQNV 
DIATNKDNIA 
SIADHDTRLN 
H* 



RQVLDRQHFE 
IKGNIGYIVR 
YEHHPADGYD 
RLADRFHNAG 
IIGAAGEIVG 
LKDYAAAAIR 
ITAHPIKRSQ 
RYGKENITSS 
GSGGGGATND 
KKDATAADVE 
LTTKLADTDA 
LEAVADTVDK 
DAKVKAAETA 
KKANSADVYT 
GLDKTVSDLR 



PDGKYHLFGS 
FSDHGHEVHS 
GPQGGGYPAP 
SMLTQGVGDG 
AGDAVQGISE 
DWAVQNPNAA 
MGAIALPKGK 
TVPPSNGKNV 
DDVKKAATVA 
ADDFKGLGLK 
ALADT DAALD 
HAEAFNDIAD 
AGKAEAAAGT 
REESDSKFVR 
KETRQGLAEQ 



RGELAERSGH 
PFDNHASHSD 
KGARDIYSYD 
FKRATRYSPE 
GSNIAVMHGL 
QGIEAVSNIF 
SAVSDNFADA 
KLADQRHPKT 
IAAAYNNGQE 
KVVTNLTKTV 
ATTNALNKLG 
SLDETNTKAD 
ANTAADKAEA 
IDGLNATTEK 
AALSGLFQPY 



IGLGKIQSHQ 
SDEAGSPVDG 
IKGVAQNIRL 
LDRSGNAAEA 
GLLSTENKMA 
MAAIPIKGIG 
AYAKYPSPYH 
GVPFDGKGFP 
INGFKAGETI 
NENKQNVDAK 
ENITTFAEET 
EAVKTANEAK 
VAAKVTDIKA 
LDTRLASAEK 
NVGLEHHHHH 
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961-ORF46.1 

1 ATGGCCACAA ACGACGACGA TGTTAAAAAA GCTGCCACTG TGGCCATTGC 

51 TGCTGCCTAC AACAATGGCC AAGAAATCAA CGGTTTCAAA GCTGGAGAGA 

5 101 CCATCTACGA CATTGATGAA GACGGCACAA TTACCAAAAA AG AC G C AAC T 

151 GCAGCCGATG TTGAAGCCGA CGACTTTAAA GGTCTGGGTC TGAAAAAAGT 

201 CGTGACTAAC CTGACCAAAA CCGTCAATGA AAACAAACAA AACGTCGATG 

251 CCAAAGTAAA AGCTGCAGAA TCTGAAATAG AAAAGTTAAC AACCAAGTTA 

301 GCAGACACTG ATGCCGCTTT AG C AG AT AC T GATGCCGCTC TGGATGCAAC 

351 CACCAACGCC TTGAATAAAT TGGGAGAAAA TATAACGACA TTTGCTGAAG 

10 4 01 AG AC T AAG AC AAATATCGTA AAAATTGATG AAAAATTAGA AGCCGTGGCT 

4 51 GATACCGTCG ACAAGCATGC CGAAGCATTC AACGATATCG CCGATTCATT 

501 GGATGAAACC AACACTAAGG CAGACGAAGC CGTCAAAACC GCCAATGAAG 

551 CCAAACAGAC GGCCGAAGAA ACCAAACAAA ACGTCGATGC CAAAGTAAAA 

601 GCTGCAGAAA CTGCAGCAGG CAAAGCCGAA GCTGCCGCTG GCACAGCTAA 

651 TACTGCAGCC GACAAGGCCG AAGCTGTCGC TGCAAAAGTT ACCGACATCA 

75 7 01 AAGCTGATAT CGCTACGAAC AAAG AT AAT A TTGCTAAAAA AGCAAACAGT 

7 51 GCCGACGTGT ACACCAGAGA AGAGTCTGAC AGCAAATTTG TCAGAATTGA 

801 TGGTCTGAAC GCTACTACCG AAAAATTGGA CACACGCTTG GCTTCTGCTG 

851 AAAAATCCAT TGCCGATCAC GATACTCGCC TGAACGGTTT GGATAAAACA 

901 GTGTCAGACC TGCGCAAAGA AACCCGCCAA GGCCTTGCAG AACAAGCCGC 

951 GCTCTCCGGT CTGTTCCAAC CTTACAACGT GGGTCGGTTC AATGTAACGG 

1001 CTGCAGTCGG CGGCTACAAA TCCGAATCGG CAGTCGCCAT CGGTACCGGC 

1051 TTCCGCTTTA CCGAAAACTT TGCCGCCAAA GCAGGCGTGG CAGTCGGCAC 

1101 TTCGTCCGGT TCTTCCGCAG CCTACCATGT CGGCGTCAAT TACGAGTGGG 

1151 GATCCGGAGG AGGAGG AT C A GATTTGGCAA ACGATTCTTT TATCCGGCAG 

1201 GTTCTCGACC GTCAGCATTT CGAACCCGAC GGGAAATACC ACCTATTCGG 

1251 CAGCAGGGGG GAACTTGCCG AGCGCAGCGG CCATATCGGA TTGGGAAAAA 

1301 TACAAAGCCA TCAGTTGGGC AACCTGATGA TTCAACAGGC GGCCATTAAA 

1351 GGAAATATCG GCTACATTGT CCGCTTTTCC GATCACGGGC ACGAAGTCCA 

14 01 TTCCCCCTTC GACAACCATG CCTCACATTC CGATTCTGAT GAAGCCGGTA 

14 51 GTCCCGTTGA CGGATTTAGC CTTTACCGCA TCCATTGGGA CGGATACGAA 

1501 CACCATCCCG CCGACGGCTA TGACGGGCCA CAGGGCGGCG GCTATCCCGC 

30 1551 TCCCAAAGGC GCGAGGGATA TATACAGCTA CGACATAAAA GGCGTTGCCC 

1601 AAAATATCCG CCTCAACCTG ACCGACAACC GCAGCACCGG ACAACGGCTT 

1651 GCCGACCGTT TCCACAATGC CGGTAGTATG CTGACGCAAG GAGTAGGCGA 

1701 CGGATTCAAA CGCGCCACCC GATACAGCCC CGAGCTGGAC AGATCGGGCA 

1751 ATGCCGCCGA AGCCTTCAAC GGCACTGCAG ATATCGTTAA AAACATCATC 

1801 GGCGCGGCAG GAGAAATTGT CGGCGCAGGC GATGCCGTGC AGGGCATAAG 

35 18 51 CGAAGGCTCA AACATTGCTG TCATGCACGG CTTGGGTCTG CTTTCCACCG 

1901 AAAACAAGAT GGCGCGCATC AACGATTTGG CAGATATGGC GCAACTCAAA 



20 



25 



1951 GACTATGCCG CAGCAGCCAT CCGCGATTGG GCAGTCCAAA ACCCCAATGC 

2001 CGCACAAGGC ATAGAAGCCG TCAGCAATAT CTTTATGGCA GCCATCCCCA 

2051 TCAAAGGGAT TGGAGCTGTT CGGGGAAAAT ACGGCTTGGG CGGCATCACG 

2101 GCACATCCTA TCAAGCGGTC GCAGATGGGC GCGATCGCAT TGCCGAAAGG 

2151 GAAATCCGCC GTCAGCGACA ATTTTGCCGA TGCGGCATAC GCCAAATACC 

2201 CGTCCCCTTA CCATTCCCGA AATATCCGTT CAAACTTGGA GCAGCGTTAC 

2251 GGCAAAGAAA ACATCACCTC CTCAACCGTG CCGCCGTCAA ACGGCAAAAA 

2301 TGTCAAACTG GCAGACCAAC GCCACCCGAA GACAGGCGTA CCGTTTGACG 

2351 GTAAAGGGTT TCCGAATTTT GAGAAGCACG TGAAATATGA TACGCTCGAG 

24 01 CACCACCACC ACCACCACTG A 
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MATNDDDVKK 
AADVEADDFK 
ADTDAALADT 
DTVDKHAEAF 
AAETAAGKAE 
ADVYTREESD 
VSDLRKETRQ 
FRFTENFAAK 
VLDRQHFEPD 
GNIGYIVRFS 
HHPADGYDGP 
ADRFHNAGSM 
GAAGEIVGAG 
DYAAAAIRDW 
AHPIKRSQMG 
GKENITSSTV 
HHHHHH* 



AAT V A I AAA Y 
GLGLKKVVTN 
DAALDATTNA 
NDIADSLDET 
AAAGTANTAA 
SKFVRIDGLN 
GLAEQAALSG 
AGVAVGTSSG 
GKYHLFGSRG 
DHGHEVHSPF 
QGGGYPAPKG 
LTQGVGDGFK 
DAVQGISEGS 
AVQN PNAAQG 
AIALPKGKSA 
PPSNGKNVKL 



NNGQEINGFK 
LTKTVNENKQ 
LNKLGENITT 
NTKADEAVKT 
DKAEAVAAKV 
ATTEKLDTRL 
LFQPYNVGRF 
SSAAYHVGVN 
ELAERSGHIG 
DNHASHSDSD 
ARDI YSYDIK 
RATRYSPELD 
NIAVMHGLGL 
IEAVSNI FMA 
VSDNFADAAY 
ADQRHPKTGV 



AGETIYDIDE 
NVDAKVKAAE 
FAEETKTNIV 
ANEAKQTAEE 
TDIKADIATN 
ASAEKSIADH 
NVTAAVGGYK 
YEWGSGGGGS 
LGKIQSHQLG 
EAGSPVDGFS 
GVAQNIRLNL 
RSGNAAEAFN 
LSTENKMARI 
AIPIKGIGAV 
AKYPSPYHSR 
PFDGKGFPNF 



DGTITKKDAT 
SEIEKLTTKL 
KIDEKLEAVA 
TKQNVDAKVK 
KDNIAKKANS 
DTRLNGLDKT 
SESAVAIGTG 
DLANDSFIRQ 
NLMIQQAAIK 
LYRIHWDGYE 
TDNRSTGQRL 
GTADIVKNII 
NDLADMAQLK 
RGKYGLGGIT 
NIRSNLEQRY 
EKHVKYDTLE 



ATGGCCACAA 
TGCTGCCTAC 
CCATCTACGA 
GCAGCCGATG 
CGTGACTAAC 
CCAAAGTAAA 
GCAGACACTG 
CACCAACGCC 
AGACTAAGAC 
GATACCGTCG 
GGATGAAACC 
CCAAACAGAC 
GCTGCAGAAA 
TACTGCAGCC 
AAGCTGATAT 
GCCGACGTGT 
TGGTCTGAAC 
AAAAAT C CAT 
GTGTCAGACC 
GCTCTCCGGT 
CTGCAGTCGG 
TTCCGCTTTA 
TTCGTCCGGT 
GATCCGGAGG 
GCACTAACCG 
GCTGGATCAG 
GTGCGGAAAA 
AAGAACGACA 
CGGGCAGCTC 
GCCATTCCGC 
CATTCCGGGA 
GGGCGAACAT 
ATCGCGGGAC 
ACCATAGATT 
ATCGCCAGAA 
GAAAACGCCA 
AAAGGCAGTT 



ACGACGACGA 
AACAATGGCC 
CATTGATGAA 
TTGAAGCCGA 
CTGACCAAAA 
AGCTGCAGAA 
ATGCCGCTTT 
TTGAATAAAT 
AAATATCGTA 
ACAAGCATGC 
AACACTAAGG 
GGCCGAAGAA 
CTGCAGCAGG 
GACAAGGCCG 
CGCTACGAAC 
ACACCAGAGA 
GCTACTACCG 
TGCCGATCAC 
TGCGCAAAGA 
CTGTTCCAAC 
CGGCTACAAA 
CCGAAAACTT 
TCTTCCGCAG 
GGGTGGTGTC 
CACCGCTCGA 
TCCGTCAGGA 
AACTTATGGA 
AGGTCAGCCG 
ATTACCTTGG 
CTTAACCGCC 
AGATGGTTGC 
ACATCTTTTG 
GGCGTTCGGT 
TCGCCGCCAA 
CTCAATGTCG 
TGCCGTCATC 
ACTCCCTCGG 



TGTTAAAAAA 
AAGAAATCAA 
GACGGCACAA 
CGACTTTAAA 
CCGTCAATGA 
TCTGAAATAG 
AGCAGATACT 
TGGGAGAAAA 
AAAATTGATG 
CGAAGCATTC 
CAGACGAAGC 
ACCAAACAAA 
CAAAGCCGAA 
AAGCTGTCGC 
AAAGATAATA 
AGAGTCTGAC 
AAAAATTGGA 
GATACTCGCC 
AACCCGCCAA 
CTTACAACGT 
TCCGAATCGG 
TGCCGCCAAA 
CCTACCATGT 
GCCGCCGACA 
CCATAAAGAC 
AAAACGAGAA 
AACGGTGACA 
TTTCGACTTT 
AGAGTGGAGA 
TTTCAGACCG 
GAAACGCCAG 
ACAAGCTTCC 
TCAGACGATG 
GCAGGGAAAC 
ACCTGGCCGC 
AGCGGTTCCG 
TATCTTTGGC 



GCTGCCACTG 
CGGTTTCAAA 
TTACCAAAAA 
GGTCTGGGTC 
AAACAAACAA 
AAAAGTTAAC 
GATGCCGCTC 
TATAACGACA 
AAAAAT TAG A 
AACGATATCG 
CGTCAAAACC 
ACGTCGATGC 
GCTGCCGCTG 
TGCAAAAGTT 
TTGCTAAAAA 
AGCAAATTTG 
CACACGCTTG 
TGAACGGTTT 
GGCCTTGCAG 
GGGTCGGTTC 
CAGTCGCCAT 
GCAGGCGTGG 
CGGCGTCAAT 
TCGGTGCGGG 
AAAGGTTTGC 
ACTGAAGCTG 
GCCTCAATAC 
ATCCGCCAAA 
GTTCCAAGTA 
AG C AAAT AC A 
TTCAGAATCG 
CGAAGGCGGC 
CCGGCGGAAA 
GGCAAAATCG 
CGCCGATATC 
TCCTTTACAA 
GGAAAAGCCC 



TGGCCATTGC 
GCTGGAGAGA 
AGACGCAACT 
TGAAAAAAGT 
AACGTCGATG 
AACCAAGTTA 
TGGATGCAAC 
TTTGCTGAAG 
AGCCGTGGCT 
CCGATTCATT 
GCCAATGAAG 
CAAAGTAAAA 
GCACAGCTAA 
ACCGACATCA 
AGCAAACAGT 
T C AG AAT T G A 
GCTTCTGCTG 
GGATAAAACA 
AACAAGCCGC 
AATGTAACGG 
CGGTACCGGC 
CAGTCGGCAC 
TACGAGTGGG 
GCTTGCCGAT 
AGTCTTTGAC 
GCGGCACAAG 
GGGCAAATTG 
TCGAAGTGGA 
TACAAACAAA 
AGATTCGGAG 
GCGACATAGC 
AGGGCGACAT 
ACTGACCTAC 
AACATTTGAA 
AAGCCGGATG 
CCAAGCCGAG 
AGGAAGTTGC 



CGGCAGCGCG GAAGTGAAAA CCGTAAACGG CATACGCCAT ATCGGCCTTG 
CCGCCAAGCA ACTCGAGCAC CACCACCACC ACCACTGA 
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MATNDDDVKK 
AADVEADDFK 
ADTDAALADT 
DTVDKHAEAF 
AAETAAGKAE 
ADVYTREESD 
VSDLRKETRQ 
FRFTENFAAK 
ALTAPLDHKD 
KNDKVSRFDF 
HSGKMVAKRQ 
TIDFAAKQGN 
KGSYSLGIFG 



AAT V A I AAA Y 
GLGLKKVVTN 
DAALDATTNA 
NDIADSLDET 
AAAGTANTAA 
SKFVRIDGLN 
GLAEQAALSG 
AGVAVGTSSG 
KGLQSLTLDQ 
IRQIEVDGQL 
FRIGDIAGEH 
GKIEHLKSPE 
GKAQEVAGSA 



NNGQEINGFK 
LTKTVNENKQ 
LNKLGENITT 
NTKADEAVKT 
DKAEAVAAKV 
ATTEKLDTRL 
LFQPYNVGRF 
SSAAYHVGVN 
SVRKNEKLKL 
ITLESGEFQV 
TSFDKLPEGG 
LNVDLAAADI 
EVKTVNGIRH 



AGETIYDIDE 
NVDAKVKAAE 
FAEETKTNIV 
ANEAKQTAEE 
TDIKADIATN 
ASAEKSIADH 
NVTAAVGGYK 
YEWGSGGGGV 
AAQGAEKTYG 
YKQSHSALTA 
RATYRGTAFG 
KPDGKRHAVI 
IGLAAKQLEH 



DGTITKKDAT 
SEIEKLTTKL 
KIDEKLEAVA 
TKQNVDAKVK 
KDNIAKKANS 
DTRLNGLDKT 
SESAVAIGTG 
AADIGAGLAD 
NGDSLNTGKL 
FQTEQIQDSE 
SDDAGGKLTY 
SGSVLYNQAE 
HHHHH* 
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961-983 



10 



15 



20 



25 



30 



35 



40 



1 


ATGGCCACAA 


51 


TGCTGCCTAC 


101 


CCATCTACGA 


151 


GCAGCCGATG 


201 


CGTGACTAAC 


251 


CCAAAGTAAA 


301 


GCAGACACTG 


351 


CACCAACGCC 


401 


AGACTAAGAC 


451 


GATACCGTCG 


501 


GGATGAAACC 


551 


C C AAAC AG AC 


601 


GCTGCAGAAA 


651 


TACTGCAGCC 


701 


AAGCTGATAT 


751 


GCCGACGTGT 


801 


TGGTCTGAAC 


851 


AAAAATCCAT 


901 


GTGTCAGACC 


951 


GCTCTCCGGT 


1001 


CTGCAGTCGG 


1051 


TTCCGCTTTA 


1101 


TTCGTCCGGT 


1151 


GATCCGGCGG 


1201 


GGTATCGGCA 


1251 


TTACGCCGGT 


1301 


CCGGTCGGGA 


1351 


CCCCCCCCGA 


1401 


CAAGAATTTG 


1451 


GCGGGGTAGA 


1501 


TCCTTTCCCG 


1551 


CAAAAACTAT 


1601 


GTAAAGACAT 


1651 


GCAAAGCCGA 


1701 


GGTCTCCCAT 


1751 


GTATTGCGCC 


1801 


AAG AAC G AAA 


1851 


CGAACGTGGC 


1901 


CAGGCACTGC 


1951 


CAAGCGTTGC 


2001 


CCTGATGCAA 


2051 


AAAACATGCT 


2101 


AAC AC AT AT G 


2151 


TATCACAGTC 


2201 


TGTATGGAGA 


2251 


GGAATTACTG 


2301 


TTTCACCCGT 


2351 


CCATCGTAAC 



ACGACGACGA 
AACAATGGCC 
CATTGATGAA 
TTGAAGCCGA 
CTGACCAAAA 
AGCTGCAGAA 
ATGCCGCTTT 
TTGAATAAAT 
AAATATCGTA 
ACAAGCATGC 
AACACTAAGG 
GGCCGAAGAA 
CTGCAGCAGG 
GACAAGGCCG 
CGCTACGAAC 
ACACCAGAGA 
GCTACTACCG 
TGCCGATCAC 
TGCGCAAAGA 
CTGTTCCAAC 
CGGCTACAAA 
CCGAAAACTT 
TCTTCCGCAG 
AGGCGGCACT 
GC AAC AG C AG 
ATCAAGAACG 
TGACGTTGCG 
ATCTGCATAC 
AT C AAC C T C A 
GGTAGGTATC 
AACTGTATGG 
ACGGCGTATA 
TGAAGCTTCT 
CGGATATCCG 
ATTATTGGCG 
CGATGCGACG 
TGATGGTTGC 
GTGCGCATCG 
CGACCTTTTC 
TCGACTATTC 
CAGAGCGATT 
TTTCATCTTT 
CCCTATTGCC 
GCAGGCGTAG 
ACCGGGTACA 
CCATGTGGTG 
AC AAAC C C G A 
CGGCACGGCG 



TGTTAAAAAA 
AAGAAATCAA 
GACGGCACAA 
CGACTTTAAA 
CCGTCAATGA 
TCTGAAATAG 
AGCAGATACT 
TGGGAGAAAA 
AAAATTGATG 
CGAAGCATTC 
CAGACGAAGC 
ACCAAACAAA 
CAAAGCCGAA 
AAGCTGTCGC 
AAAGATAATA 
AG AG T C T G AC 
AAAAATTGGA 
GATACTCGCC 
AACCCGCCAA 
CTTACAACGT 
TCCGAATCGG 
TGCCGCCAAA 
CCTACCATGT 
TCTGCGCCCG 
AG C AAC AAC A 
AAATGTGCAA 
GTTACAGACA 
CGGAGACTTT 
AACCTGCAAT 
GTCGACACAG 
CAGAAAAGAA 
TGCGGAAGGA 
TTCGACGATG 
CCACGTAAAA 
GGCGTTCCGT 
CTACACATAA 
AGCCATCCGC 
TCAATAACAG 
CAAATAGCCA 
CGGCGGTGAT 
ACGGCAACCT 
TCGACAGGCA 
AT T T TAT G AA 
ACCGCAGTGG 
GAACCGCTTG 
CCTGTCGGCA 
TTCAAATTGC 
GCTCTGCTGC 



GCTGCCACTG 
CGGTTTCAAA 
TTACCAAAAA 
GGTCTGGGTC 
AAACAAACAA 
AAAAGT T AAC 
GATGCCGCTC 
TATAACGACA 
AAAAAT TAG A 
AAC GAT AT CG 
CGTCAAAACC 
ACGTCGATGC 
GCTGCCGCTG 
TGCAAAAGTT 
TTGCTAAAAA 
AGCAAATTTG 
CACACGCTTG 
TGAACGGTTT 
GGCCTTGCAG 
GGGTCGGTTC 
CAGTCGCCAT 
GCAGGCGTGG 
CGGCGTCAAT 
ACTT CAATGC 
GCGAAATCAG 
AG AC AG AAG C 
GGGATGCCAA 
CCAAACCCAA 
TGAAGCAGGC 
GCGAATCCGT 
CACGGCTATA 
AGCGCCTGAA 
AGGCCGTTAT 
GAAATCGGAC 
GGACGGCAGA 
TGAATACGAA 
AATGCATGGG 
TTTT GGAACA 
ATTCGGAGGA 
AAAACAGACG 
GTCCTACCAC 
ATGACGCACA 
AAAGACGCTC 
AGAAAAGTTC 
AGTATGGCTC 
CCCTATGAAG 
CGGAACATCC 
TGCAGAAATA 



TGGCCATTGC 
GCTGGAGAGA 
AGACGCAACT 
TGAAAAAAGT 
AACGTCGATG 
AACCAAGTTA 
TGGATGCAAC 
TTTGCTGAAG 
AGCCGTGGCT 
CCGATTCATT 
GCCAATGAAG 
CAAAGTAAAA 
GCACAGCTAA 
ACCGACATCA 
AGCAAACAGT 
TCAGAATTGA 
GCTTCTGCTG 
GGATAAAACA 
AACAAGCCGC 
AATGTAACGG 
CGGTACCGGC 
CAGTCGGCAC 
TACGAGTGGG 
AGGCGGTACC 
C AG C AG TAT C 
ATGCTCTGTG 
AATCAATGCC 
AT G AC G CAT A 
TATACAGGAC 
CGGCAGCATA 
ACGAAAATTA 
GACGGAGGCG 
AGAGACTGAA 
ACATCGATTT 
CCTGCAGGCG 
TGATGAAACC 
TCAAGCTGGG 
ACATCGAGGG 
GCAGTACCGC 
AGGGTATCCG 
ATCCGTAATA 
AGCTCAGCCC 
AAAAAG G C AT 
AAACGGGAAA 
CAACCATTGC 
CAAGCGTCCG 
TTTTCCGCAC 
CCCGTGGATG 
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10 



15 



20 



25 



30 



35 



2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 
3351 
3401 
3451 
3501 
3551 
3601 
3651 
3701 
3751 
3801 
3851 
3901 
3951 
4001 
4051 
4101 
4151 
4201 
4251 
4301 



AGCAACGACA 
TGCAGTCGGC 
AGGCCATGAA 
ACGAAAGGTA 
CACGGGCGGC 
ACAACACCTA 
TACGGCAACA 
TTATAACGGG 
ATCTGGCAGA 
GGCAGTCTGC 
ACTGCTGAAA 
CGGCACGCGG 
CCCTTCCTGA 
CAT C G AAACC 
CAGCGGGCAG 
GCGGCACGGA 
ACACGCCGTA 
TGGATGCCTC 
GCCGACCGCA 
CGCAGCGGCA 
TCAACAGTCT 
GATATGCAGG 
CGGCACGGGT 
GGGAACAGGG 
GGCATTGCCG 
CATGGGACGC 
GCATTAGTCT 
CTCAAAGGCC 
CACCGGTGCG 
AGCTGGGCGC 
TTGACGGTCG 
CGCCGAAAAA 
GCACGCTGGT 
AAAGCCGTCC 
CGACTACACG 
AGACGGGGGC 
GCGGATGTCG 
CGCCGGTTCC 
ACCGGTTCCT 



ACCTGCGTAC 
GTGGACAGCA 
CGGACCCGCG 
CATCCGATAT 
CTGATCAAAA 
TACGGGCAAA 
ACAAATCGGA 
GCGGCATCCG 
TACCGACCAA 
AGCTGGACGG 
GTGGACGGTA 
CAAGGGGGCA 
GTGCCGCCAA 
GACGGCGGCC 
TGAAGGCGAC 
CTGCTTCGGC 
GAACAGGGCG 
CGAATCATCC 
CAGATATGCC 
GCCGTACAGC 
CGCCGCTACC 
GACGCCGCCT 
CTGCGCGTCA 
CGGTGTTGAA 
CGAAAACCGG 
AGCACATGGA 
GTTTGCAGGC 
TGTTCTCCTA 
GACGAACATG 
ACTGGGCGGT 
AAGGCGGTCT 
GGCAGTGCTT 
CGGACTCGCG 
TGTTTGCAAC 
GTAACGGGCG 
ACGCAATATG 
AATTCGGCAA 
AAACAGTACG 
CGAGCACCAC 



CACGTTGCTG 
AGTTCGGCTG 
TCCTTTCCGT 
TGCCTACTCC 
AAGGCGGCAG 
AC CAT TAT CG 
TATGCGCGTC 
GCGGCAGCCT 
TCCGGCGCAA 
CAAAGGTACG 
CGGCGATTAT 
GGCTATCTCA 
AATCGGGCAG 
TGCTGGCTTC 
ACGCTGTCCT 
AGCGGCACAT 
GCAGCAATCT 
GCAACACCCG 
GGGCATCCGC 
ATGCGAATGC 
GTCTATGCCG 
GAAAGCCGTA 
TCGCGCAAAC 
GGCAAAATGC 
CGAAAATACG 
GCGAAAACAG 
ATACGGCACG 
CGGACGCTAC 
CGGAAGGCAG 
GTCAACGTTC 
GCGCTACGAC 
TGGGCTGGAG 
GGTCTGAAGC 
GGCGGGCGTG 
GCTTTACCGG 
CCGCACACCC 
CGGCTGGAAC 
GCAACCACAG 
CACCACCACC 



ACGACGGCTC 
GGGACTGCTG 
TCGGCGACTT 
TTCCGTAACG 
CCAACTGCAA 
AAGGCGGTTC 
GAAACCAAAG 
GAACAGCGAC 
ACGAAACCGT 
CTGTACACAC 
CGGCGGCAAG 
ACAGTACCGG 
GATTATTCTT 
CCTCGACAGC 
ATTATGTCCG 
TCCGCGCCCG 
GGAAAACCTG 
AGACGGTTGA 
CCCTACGGCG 
CGCCGACGGT 
ACAGTACCGC 
TCGGACGGGT 
CCAACAGGAC 
GCGGCAGTAC 
ACAGCAGCCG 
TGCAAATGCA 
ATGCGGGCGA 
AAAAACAGCA 
CGTCAACGGC 
CGTTTGCCGC 
CTGCTCAAAC 
CGGCAACAGC 
TGTCGCAACC 
GAACGCGACC 
CGCGACTGCA 
GTCTGGTTGC 
GGCTTGGCAC 
CGGACGAGTC 
ACTGA 



AGGACATCGG 
GATGCGGGTA 
TACCGCCGAT 
ACATTTCAGG 
CTGCACGGCA 
GCTGGTGTTG 
GTGCGCTGAT 
GGCATTGTCT 
AC AC AT C AAA 
GTTTGGGCAA 
CTGTACATGT 
ACGACGTGTT 
T CTT C AC AAA 
GTCGAAAAAA 
TCGCGGCAAT 
CCGGTCTGAA 
ATGGTCGAAC 
AACTGCGGCA 
CAACTTTCCG 
GTACGCATCT 
CGCCCATGCC 
TGGACCACAA 
GGTGGAACGT 
CCAAACCGTC 
CCACACTGGG 
AAAACCGACA 
TATCGGCTAT 
TCAGCCGCAG 
ACGCTGATGC 
AACGGGAGAT 
AGGATGCATT 
CTCACTGAAG 
CTTGAGCGAT 
TGAACGGACG 
GCAACCGGCA 
CGGCCTGGGC 
GTTACAGCTA 
GGCGTAGGCT 
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MATNDDDVKK 
AADVEADDFK 
ADT DAALADT 
DTVDKHAEAF 
AAETAAGKAE 
ADVYTREESD 
VSDLRKETRQ 
FRFTENFAAK 
GIGSNSRATT 
PPPNLHTGDF 
SFPELYGRKE 
AKPTDIRHVK 
KNEMMVAAIR 
QALLDYSGGD 
NTYALLPFYE 
GITAMWCLSA 
SNDNLRTTLL 
TKGTSDIAYS 
YGNNKSDMRV 
GSLQLDGKGT 
PFLSAAKIGQ 
AARTASAAAH 
ADRTDMPGIR 
DMQGRRLKAV 
GIAAKTGENT 
LKGLFSYGRY 
LTVEGGLRYD 



AAT V A I AAA Y 
GLGLKKWTN 
DAALDATTNA 
NDIADSLDET 
AAAGTANTAA 
SKFVRIDGLN 
GLAEQAALSG 
AGVAVGTS SG 
AK S AAV S Y AG 
PNPNDAYKNL 
HGYNENYKNY 
EIGHIDLVSH 
NAWVKLGERG 
KTDEGIRLMQ 
KDAQKGIITV 
PYEASVRFTR 
TTAQDIGAVG 
FRNDISGTGG 
ETKGALIYNG 
LYTRLGKLLK 
DYSFFTNIET 
SAPAGLKHAV 
PYGATFRAAA 
SDGLDHNGTG 
TAAATLGMGR 
KNSISRSTGA 
LLKQDAFAEK 



NNGQEINGFK 
LTKTVNENKQ 
LNKLGENITT 
NTKADEAVKT 
DKAEAVAAKV 
ATTEKLDTRL 
LFQPYNVGRF 
SSAAYHVGVN 
IKNEMCKDRS 
INLKPAIEAG 
TAYMRKEAPE 
IIGGRSVDGR 
VRIVNNSFGT 
QSDYGNLSYH 
AGVDRSGEKF 
TNPIQIAGTS 
VDSKFGWGLL 
LIKKGGSQLQ 
AASGGSLNSD 
VDGTAIIGGK 
DGGLLASLDS 
EQGGSNLENL 
AVQHANAADG 
LRVIAQTQQD 
STWSENSANA 
DEHAEGSVNG 
GSALGWSGNS 



AGETIYDIDE 
N V D AKVKAAE 
FAEETKTNIV 
ANEAKQTAEE 
TDIKADIATN 
ASAEKSIADH 
NVTAAVGGYK 
YEWGSGGGGT 
MLCAGRDDVA 
YTGRGVEVGI 
DGGGKDIEAS 
PAGGIAPDAT 
TSRAGTADLF 
IRNKNMLFIF 
KREMYGEPGT 
FSAPIVTGTA 
DAGKAMNGPA 
LHGNNTYTGK 
GIVYLADTDQ 
LYMSARGKGA 
VEKTAGSEGD 
MVELDASESS 
VRIFNSLAAT 
GGTWEQGGVE 
KTDSISLFAG 
TLMQLGALGG 
LTEGTLVGLA 



DGTITKKDAT 
SEIEKLTTKL 
KIDEKLEAVA 
TKQNVDAKVK 
KDNIAKKANS 
DTRLNGLDKT 
SESAVAIGTG 
SAPDFNAGGT 
VTDRDAKINA 
VDTGESVGSI 
FDDEAVIETE 
LHIMNTNDET 
QIANSEEQYR 
STGNDAQAQP 
EPLEYGSNHC 
ALLLQKYPWM 
SFPFGDFTAD 
TIIEGGSLVL 
SGANETVHIK 
GYLNSTGRRV 
TLSYYVRRGN 
ATPETVETAA 
VYADSTAAHA 
GKMRGSTQTV 
IRHDAGDIGY 
VNVPFAATGD 
GLKLSQPLSD 



KAVLFATAGV ERDLNGRDYT VTGGFTGATA ATGKTGARNM PHTRLVAGLG 
ADVE FGNGWN GLARYSYAGS KQYGNHSGRV GVGYRFLEHH HHHH* 
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961C-ORF46. 1 



1 
1 


A i CCCCACAA 


C 1 

D ± 


1 uL 1 1-3 I A^ 


1U1 


pp a t p t a pp a 

CCA1 Ulntun 


1 ^1 


p. c a p, p p p. a t p. 


90, 1 


PPTP, aPT a Z\ P 


0 1 


ppaaaPTaa a 

CCAAAlj 1 AAA 


■501 


ppapapaPTP 


J Ol 


pappaapppp 


41 U 1 


apapTa a pap 

rtbnL 1 AAo AC 


4^1 
1 Jl 


PaTaPPPTPP 
oA 1 Ab,ObJ J. i^bi 


^o 1 


ppaTpa a app 


CC.1 

jjI 


rr , aaapar*ap 
L, CAAACACAC 


bU 1 


r , r , Tvrr'T\c a a a 
bb 1 bLAbAAA 


/TCI 

bol 


1 AL 1 bbAbbb 


/ U 1 


a AfPTT AT"7\ r P 

AAbb i bA 1 A 1 


TCI 

/ Dl 


bbbbAbb ibi. 


o U 1 


1 bb ibi bAAb 


O ~>1 


aaaaa r r/^r , a r n 
AAAAA 1 CCA 1 


yui 


b 1 b 1 bAbAbb 


y oi 


bbl b 1 bbbbl 


1001 


GAT CAbA r r T 


lUol 


LAI 1 IbbAAL 


11U1 


rbbbbAbbbb 


llOl 


1 bbbbAAbb 1 




Al Ibl bbbbl 


lZDl 


CCA1 bbb 1 CA 


UUl 


TTAGCCTTTA 


1 TCI 


bbb I A 1 bAbb 


14U1 


bbA 1 A 1 A 1 AC 


1 i CI 

14 31 


Abb 1 bAbbbA 


lOUl 


AA1 bbbbbl A 


1001 


p a r* pp a hp a 


loUl 


J. bAAbbbbAb 


1 DDI 


Alibi Cbbbb 


1 / Ul 


TGCTGTcATG 


1 / D 1 


bbA i CAACbA 


1801 


7\ m /~i /—i /~< /~» 

GCCATCCGCG 


1851 


AGCCGTCAGC 


1901 


CTGTTCGGGG 


1951 


CGGTCGCAGA 


2001 


CGACAATTTT 


2051 


CCCGAAATAT 


2101 


ACCTCCTCAA 


2151 


CCAACGCCAC 


2201 


ATTTTGAGAA 


2251 


CACTGA 



ACGACGACGA TGTTAAAAAA 
AACAATGGCC AAGAAATCAA 
CAT T GAT G AA GACGGCACAA 
TTGAAGCCGA CGACTTTAAA 
CTGACCAAAA CCGTCAATGA 
AGCTGCAGAA TCTGAAATAG 
ATGCCGCTTT AGCAGATACT 
TTGAATAAAT TGGGAGAAAA 
AAATATCGTA AAAATTGATG 
ACAAGCATGC CGAAGCATTC 
AACACTAAGG CAGACGAAGC 
GGCCGAAGAA ACCAAACAAA 
CTGCAGCAGG CAAAGCCGAA 
GACAAGGCCG AAGCTGTCGC 
CGCTACGAAC AAAGATAATA 
ACACCAGAGA AGAGTCTGAC 
GCTACTACCG AAAAATTGGA 
TGCCGATCAC GATACTCGCC 
TGCGCAAAGA AACCCGCCAA 
CTGTTCCAAC CTTACAACGT 
GGCAAACGAT TCTTTTATCC 
CCGACGGGAA ATACCACCTA 
AGCGGCCATA TCGGATTGGG 
GATGATTCAA CAGGCGGCCA 
TTTCCGATCA CGGGCACGAA 
CATTCCGATT CTGATGAAGC 
CCGCATCCAT TGGGACGGAT 
GGCCACAGGG CGGCGGCTAT 
AGCTACGACA TAAAAGGCGT 
CAACCGCAGC ACCGGACAAC 
GT AT G C T G AC GCAAGGAGTA 
AGCCCCGAGC T GG AC AG AT C 
TGCAGATATC GTTAAAAACA 
CAGGCGATGC CGTGCAGGGC 
CACGGCTTGG GTCTGCTTTC 
TTTGGCAGAT ATGGCGCAAC 
ATTGGGCAGT CCAAAACCCC 
AATATCTTTA TGGCAGCCAT 
AAAATACGGC TTGGGCGGCA 
TGGGCGCGAT CGCATTGCCG 
GCCGATGCGG CATACGCCAA 
CCGTTCAAAC TTGGAGCAGC 
CCGTGCCGCC GTCAAACGGC 
CCGAAGACAG GCGTACCGTT 
GCACGTGAAA TATGATACGC 



GCTGCCACTG TGGCCATTGC 

CGGTTTCAAA GCTGGAGAGA 

TTACCAAAAA AGACGCAACT 

GGTCTGGGTC TGAAAAAAGT 

AAACAAACAA AACGT CGATG 

AAAAGTTAAC AACCAAGTTA 

GATGCCGCTC TGGATGCAAC 

TATAACGACA TTTGCTGAAG 

AAAAAT TAG A AGCCGTGGCT 

AACGATATCG CCGATTCATT 

CGTCAAAACC GCCAATGAAG 

ACGTCGATGC CAAAGTAAAA 

GCTGCCGCTG GCACAGCTAA 

TGCAAAAGTT ACCGACATCA 

TTGCTAAAAA AGCAAACAGT 

AGCAAATTTG TCAGAATTGA 

CACACGCTTG GCTTCTGCTG 

TGAACGGTTT GGATAAAACA 

GGCCTTGCAG AACAAGCCGC 

GGGTGGATCC GGAGGAGGAG 

GGCAGGTTCT CGACCGTCAG 

TTCGGCAGCA GGGGGGAACT 

AAAAAT ACAA AGCCATCAGT 

T T AAAGG AAA TATCGGCTAC 

GTCCATTCCC CCTTCGACAA 

CGGTAGTCCC GTTGACGGAT 

ACGAACACCA TCCCGCCGAC 

CCCGCTCCCA AAGGCGCGAG 

TGCCCAAAAT ATCCGCCTCA 

GGCTTGCCGA CCGTTTCCAC 

GGCGACGGAT TCAAACGCGC 

GGGCAATGCC GCCGAAGCCT 

TCATCGGCGC GGCAGGAGAA 

ATAAGCGAAG GCTCAAACAT 

CACCGAAAAC AAGATGGCGC 

TCAAAGACTA TGCCGCAGCA 

AATGCCGCAC AAGGCATAGA 

CCCCATCAAA GGGATTGGAG 

TCACGGCACA TCCTATCAAG 

AAAGGGAAAT CCGCCGTCAG 

ATACCCGTCC CCTTACCATT 

GTTACGGCAA AG AAAAC AT C 

AAAAATGTCA AACTGGCAGA 

TGACGGTAAA GGGTTTCCGA 

TCGAGCACCA CCACCACCAC 
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1 MATNDDDVKK AATVAIAAAY NNGQEINGFK AGETI YDIDE DGTITKKDAT 

51 AADVEADDFK GLGLKKVVTN LTKTVNENKQ NVDAKVKAAE SEIEKLTTKL 

101 ADTDAALADT DAALDATTNA LNKLGENITT FAEETKTNIV KIDEKLEAVA 

151 DTVDKHAEAF NDIADSLDET NTKADEAVKT ANEAKQTAEE TKQNVDAKVK 

201 AAETAAGKAE AAAGTANTAA DKAEAVAAKV TDIKADIATN KDNIAKKANS 

251 ADVYTREESD SKFVRIDGLN ATTEKLDTRL ASAEKSIADH DTRLNGLDKT 

301 VSDLRKETRQ GLAEQAALSG LFQPYNVGGS GGGGSDLAND SFIRQVLDRQ 

351 HFEPDGKYHL FGSRGELAER SGHIGLGKIQ SHQLGNLMIQ QAAIKGNIGY 

401 IVRFSDHGHE VHSPFDNHAS HSDSDEAGSP VDGFSLYRIH WDGYEHHPAD 

451 GYDGPQGGGY PAPKGARDIY SYDIKGVAQN IRLNLTDNRS TGQRLADRFH 

501 NAGSMLTQGV GDGFKRATRY SPELDRSGNA AEAFNGTADI VKNIIGAAGE 

551 IVGAGDAVQG ISEGSNIAVM HGLGLLSTEN KMARINDLAD MAQLKDYAAA 

601 AIRDWAVQNP NAAQGIEAVS NIFMAAIPIK GIGAVRGKYG LGGITAHPIK 

651 RSQMGAIALP KGKSAVSDNF ADAAYAKYPS PYHSRNIRSN LEQRYGKENI 

701 TSSTVPPSNG KNVKLADQRH PKTGVPFDGK GFPNFEKHVK YDTLEHHHHH 
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20 



25 
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50 



961c-741 

1 

51 

101 

151 

201 

251 

301 

351 

401 

451 

501 

551 

601 

651 

701 

751 

801 

851 

901 

951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 



ATGGCCACAA 
TGCTGCCTAC 
CCATCTACGA 
GCAGCCGATG 
CGTGACTAAC 
C C AAAG T AAA 
GCAGACACTG 
CACCAACGCC 
AGACTAAGAC 
GATACCGTCG 
GGATGAAACC 
CCAAACAGAC 
GCTGCAGAAA 
TACTGCAGCC 
AAGCTGATAT 
GCCGACGTGT 
TGGTCTGAAC 
AAAAAT C CAT 
GTGTCAGACC 
GCTCTCCGGT 
GTGTCGCCGC 
CTCGACCATA 
CAGGAAAAAC 
ATGGAAACGG 
AGCCGTTTCG 
CTTGGAGAGT 
CCGCCTTTCA 
GTTGCGAAAC 
TTTTGACAAG 
TCGGTTCAGA 
GCCAAGCAGG 
TGTCGACCTG 
TCATCAGCGG 
CTCGGTATCT 
GAAAACCGTA 
AGCACCACCA 



ACGACGACGA 
AACAATGGCC 
CATTGATGAA 
TTGAAGCCGA 
CTGACCAAAA 
AGCTGCAGAA 
ATGCCGCTTT 
TTGAATAAAT 
AAATATCGTA 
ACAAGCATGC 
AACACTAAGG 
GGCCGAAGAA 
CTGCAGCAGG 
GACAAGGCCG 
CGCTACGAAC 
ACACCAGAGA 
GCTACTACCG 
TGCCGATCAC 
TGCGCAAAGA 
CTGTTCCAAC 
CGACATCGGT 
AAGACAAAGG 
GAGAAACTGA 
TGACAGCCTC 
ACTTTATCCG 
GGAGAGTTCC 
GACCGAGCAA 
GCCAGTTCAG 
CTTCCCGAAG 
CGATGCCGGC 
GAAACGGCAA 
GCCGCCGCCG 
TTCCGTCCTT 
TTGGCGGAAA 
AACGGCATAC 
CCACCACCAC 



TGTTAAAAAA 
AAGAAATCAA 
GACGGCACAA 
CGACTTTAAA 
CCGTCAATGA 
TCTGAAATAG 
AGCAGATACT 
TGGGAGAAAA 
AAAATTGATG 
CGAAGCATTC 
CAGACGAAGC 
AC C AAAC AAA 
C AAAG C C G AA 
AAGCTGTCGC 
AAAGATAATA 
AGAGTCTGAC 
AAAAATTGGA 
GATACTCGCC 
AACCCGCCAA 
CTTACAACGT 
GCGGGGCTTG 
TTTGCAGTCT 
AGCTGGCGGC 
AATACGGGCA 
CCAAATCGAA 
AAGTATACAA 
ATACAAGATT 
AAT CGGCGAC 
GCGGCAGGGC 
GGAAAACTGA 
AAT CGAACAT 
ATATCAAGCC 
TACAACCAAG 
AG CC C AG G AA 
GCCATATCGG 
TGA 



GCTGCCACTG 
CGGTTTCAAA 
TTACCAAAAA 
GGTCTGGGTC 
AAACAAACAA 
AAAAGTTAAC 
GATGCCGCTC 
TATAACGACA 
AAAAAT TAG A 
AACGATATCG 
CGTCAAAACC 
ACGTCGATGC 
GCTGCCGCTG 
TGCAAAAGTT 
TTGCTAAAAA 
AGCAAATTTG 
CACACGCTTG 
TGAACGGTTT 
GGCCTTGCAG 
GGGTGGATCC 
CCGATGCACT 
TTGACGCTGG 
ACAAGGTGCG 
AATTGAAGAA 
GTGGACGGGC 
ACAAAGCCAT 
CGGAGCATTC 
ATAGCGGGCG 
GACATATCGC 
CCTACACCAT 
TTGAAATCGC 
GGATGGAAAA 
CCGAGAAAGG 
GTTGCCGGCA 
CCTTGCCGCC 



TGGCCATTGC 
GCTGGAGAGA 
AGACGCAACT 
TGAAAAAAGT 
AACGTCGATG 
AACCAAGTTA 
TGGATGCAAC 
TTTGCTGAAG 
AGCCGTGGCT 
CCGATTCATT 
GCCAATGAAG 
CAAAGTAAAA 
GCACAGCTAA 
ACCGACATCA 
AGCAAACAGT 
TCAGAATTGA 
GCTTCTGCTG 
GGATAAAACA 
AACAAGCCGC 
GGAGGGGGTG 
AACCGCACCG 
ATCAGTCCGT 
GAAAAAACTT 
CGACAAGGTC 
AGCTCATTAC 
TCCGCCTTAA 
CGGGAAGATG 
AACATACATC 
GGGACGGCGT 
AGATTTCGCC 
CAGAACTCAA 
CGCCATGCCG 
CAGTTACTCC 
GCGCGGAAGT 
AAGCAACTCG 
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MATNDDDVKK 
AADVEADDFK 
ADTDAALADT 
DTVDKHAEAF 
AAETAAGKAE 
ADVYTREESD 
VSDLRKETRQ 
LDHKDKGLQS 
SRFDFIRQIE 
VAKRQFRIGD 
AKQGNGKIEH 
LGIFGGKAQE 



AATVAIAAAY 
GLGLKBCWTN 
DAALDATTNA 
NDIADSLDET 
AAAGTANTAA 
SKFVRIDGLN 
GLAEQAALSG 
LTLDQSVRKN 
VDGQLITLES 
IAGEHTSFDK 
LKSPELNVDL 
VAGSAEVKTV 



NNGQEINGFK 
LTKTVNENKQ 
LNKLGENITT 
NTKADEAVKT 
DKAEAVAAKV 
ATTEKLDTRL 
LFQPYNVGGS 
EKLKLAAQGA 
GEFQVYKQSH 
LPEGGRATYR 
AAADIKPDGK 
NGIRHIGLAA 



AGETIYDIDE 
NVDAKVKAAE 
FAEETKTNIV 
ANEAKQTAEE 
TDIKADIATN 
ASAEKSIADH 
GGGGVAADIG 
EKTYGNGDSL 
SALTAFQTEQ 
GTAFGSDDAG 
RHAVISGSVL 
KQLEHHHHHH 



DGTITKKDAT 
SEIEKLTTKL 
KIDEKLEAVA 
TKQNVDAKVK 
KDNIAKKANS 
DTRLNGLDKT 
AGLADALTAP 
NTGKLKNDKV 
IQDSEHSGKM 
GKLTYTIDFA 
YNQAEKGSYS 



ATGGCCACAA 
TGCTGCCTAC 
CCATCTACGA 
GCAGCCGATG 
CGTGACTAAC 
CCAAAGTAAA 
GCAGACACTG 
CACCAACGCC 
AGACTAAGAC 
GATACCGTCG 
GGATGAAACC 



ACGACGACGA 
AACAATGGCC 
CATTGATGAA 
TTGAAGCCGA 
CTGACCAAAA 
AGCTGCAGAA 
ATGCCGCTTT 
TTGAATAAAT 
AAATATCGTA 
ACAAGCATGC 
AACACTAAGG 



TGTTAAAAAA 
AAGAAATCAA 
GACGGCACAA 
CGACTTTAAA 
CCGTCAATGA 
TCTGAAATAG 
AG C AG AT ACT 
TGGGAGAAAA 
AAAATTGATG 
CGAAGCATTC 
CAGACGAAGC 



GCTGCCACTG 
CGGTTTCAAA 
TTACCAAAAA 
GGTCTGGGTC 
AAACAAACAA 
AAAAGTTAAC 
GATGCCGCTC 
TATAACGACA 
AAAAATTAGA 
AACGATATCG 
CGTCAAAACC 



TGGCCATTGC 
GCTGGAGAGA 
AGACGCAACT 
TGAAAAAAGT 
AACGTCGATG 
AACCAAGTTA 
TGGATGCAAC 
TTTGCTGAAG 
AGCCGTGGCT 
CCGATTCATT 
GCCAATGAAG 
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25 



551 CCAAACAGAC GGCCGAAGAA AC C AAAC AAA ACGTCGATGC CAAAGTAAAA 

601 GCTGCAGAAA CTGCAGCAGG CAAAGCCGAA GCTGCCGCTG GCACAGCTAA 

651 TACTGCAGCC GACAAGGCCG AAGCTGTCGC TGCAAAAGTT ACCGACATCA 

701 AAGCTGATAT CGCTACGAAC AAAGATAATA TTGCTAAAAA AG CAAAC AG T 

5 7 51 GCCGACGTGT AC AC C AG AG A AGAGTCTGAC AGCAAATTTG TCAGAATTGA 

801 TGGTCTGAAC GCTACTACCG AAAAATTGGA CACACGCTTG GCTTCTGCTG 

851 AAAAATCCAT TGCCGATCAC GATACTCGCC TGAACGGTTT GGATAAAACA 

901 GTGTCAGACC TGCGCAAAGA AACCCGCCAA GGCCTTGCAG AACAAGCCGC 

951 GCTCTCCGGT CTGTTCCAAC CTTACAACGT GGGTGGATCC GGCGGAGGCG 

1001 GCACTTCTGC GCCCGACTTC AATGCAGGCG GTACCGGTAT CGGCAGCAAC 

10 1051 AGCAGAGCAA CAACAGCGAA ATCAGCAGCA GTATCTTACG CCGGTATCAA 

1101 GAACGAAATG TGCAAAGACA GAAGCATGCT CTGTGCCGGT CGGGATGACG 

1151 TTGCGGTTAC AGACAGGGAT GCCAAAATCA ATGCCCCCCC CCCGAATCTG 

1201 CATACCGGAG ACTTTCCAAA CCCAAATGAC GCATACAAGA ATTTGATCAA 

1251 CCTCAAACCT GCAATTGAAG CAGGCTATAC AGGACGCGGG GTAGAGGTAG 

1301 GTATCGTCGA CACAGGCGAA TCCGTCGGCA GCATATCCTT TCCCGAACTG 

15 1351 TATGGCAGAA AAGAACACGG CTATAACGAA AATTACAAAA ACTATACGGC 

14 01 GTATATGCGG AAGGAAGCGC CTGAAGACGG AGGCGGTAAA GACATTGAAG 

14 51 CTTCTTTCGA CGATGAGGCC GTTATAGAGA CTGAAGCAAA GCCGACGGAT 

1501 ATCCGCCACG TAAAAGAAAT CGGACACATC GATTTGGTCT CCCATATTAT 

1551 TGGCGGGCGT TCCGTGGACG GCAGACCTGC AGGCGGTATT GCGCCCGATG 

1601 CGACGCTACA CATAATGAAT ACGAATGATG AAACCAAGAA CGAAATGATG 

20 1651 GTTGCAGCCA TCCGCAATGC ATGGGTCAAG CTGGGCGAAC GTGGCGTGCG 

1701 CATCGTCAAT AACAGTTTTG GAACAACATC GAGGGCAGGC ACTGCCGACC 

1751 TTTTCCAAAT AGCCAATTCG GAGGAGCAGT ACCGCCAAGC GTTGCTCGAC 

1801 TATTCCGGCG GTGATAAAAC AGACGAGGGT ATCCGCCTGA TGCAACAGAG 

1851 CGATTACGGC AACCTGTCCT ACCACATCCG TAATAAAAAC ATGCTTTTCA 

1901 TCTTTTCGAC AGGCAATGAC GCACAAGCTC AGCCCAACAC ATATGCCCTA 

1951 TTGCCATTTT AT GAAAAAG A CGCT CAAAAA GGCATTATCA CAGTCGCAGG 

2001 CGTAGACCGC AGTGGAGAAA AGTTCAAACG GGAAATGTAT GGAGAACCGG 

2051 GTACAGAACC GCTTGAGTAT GGCTCCAACC ATTGCGGAAT TACTGCCATG 

2101 TGGTGCCTGT CGGCACCCTA TGAAGCAAGC GTCCGTTTCA CCCGTACAAA 

2151 CCCGATTCAA ATTGCCGGAA CATCCTTTTC CGCACCCATC GTAACCGGCA 

2201 CGGCGGCTCT GCTGCTGCAG AAATACCCGT GG AT GAG C AA CGACAACCTG 

2251 CGTACCACGT TGCTGACGAC GGCTCAGGAC ATCGGTGCAG TCGGCGTGGA 

2301 CAGCAAGTTC GGCTGGGGAC TGCTGGATGC GGGTAAGGCC ATGAACGGAC 

2351 CCGCGTCCTT TCCGTTCGGC GACTTTACCG CCGATACGAA AGGTACATCC 

24 01 GATATTGCCT ACTCCTTCCG TAACGACATT TCAGGCACGG GCGGCCTGAT 

2 4 51 CAAAAAAGGC GGCAGCCAAC TGCAACTGCA CGGCAACAAC ACCTATACGG 

2501 GCAAAACCAT TATCGAAGGC GGTTCGCTGG TGTTGTACGG CAACAACAAA 

2551 TCGGATATGC GCGTCGAAAC CAAAGGTGCG CT GAT T TATA ACGGGGCGGC 

35 2 601 ATCCGGCGGC AGCCTGAACA GCGACGGCAT TGTCTATCTG GCAGATACCG 

2 651 ACCAATCCGG CGCAAACGAA ACCGTACACA TCAAAGGCAG TCTGCAGCTG 

2701 GACGGCAAAG GTACGCTGTA CACACGTTTG GGCAAACTGC T GAAAGT GG A 

2751 CGGTACGGCG ATTATCGGCG GCAAGCTGTA CATGTCGGCA CGCGGCAAGG 

2801 GGGCAGGCTA TCTCAACAGT ACCGGACGAC GTGTTCCCTT CCTGAGTGCC 

2 851 GCCAAAATCG GGCAGGATTA TTCTTTCTTC ACAAACATCG AAACCGACGG 

40 2 901 CGGCCTGCTG GCTTCCCTCG ACAGCGTCGA AAAAACAGCG GGCAGTGAAG 

2 951 GCGACACGCT GTCCTATTAT GTCCGTCGCG GCAATGCGGC ACGGACTGCT 
3001 TCGGCAGCGG CACATTCCGC GCCCGCCGGT CTGAAACACG CCGTAGAACA 
3051 GGGCGGCAGC AATCTGGAAA ACCTGATGGT CGAACTGGAT GCCTCCGAAT 
3101 CATCCGCAAC ACCCGAGACG GTTGAAACTG CGGCAGCCGA CCGCACAGAT 
3151 ATGCCGGGCA TCCGCCCCTA CGGCGCAACT TTCCGCGCAG CGGCAGCCGT 

45 3201 ACAGCATGCG AATGCCGCCG ACGGTGTACG CATCTTCAAC AGTCTCGCCG 

3251 CTACCGTCTA TGCCGACAGT ACCGCCGCCC ATGCCGATAT GCAGGGACGC 

3301 CGCCTGAAAG CCGTATCGGA CGGGTTGGAC CACAACGGCA CGGGTCTGCG 

3351 CGTCATCGCG CAAACCCAAC AGGACGGTGG AACGTGGGAA CAGGGCGGTG 

3401 TTGAAGGCAA AATGCGCGGC AGT AC CCAAA CCGTCGGCAT TGCCGCGAAA 

3451 ACCGGCGAAA ATACGACAGC AGCCGCCACA CTGGGCATGG GACGCAGCAC 

50 3501 ATGGAGCGAA AACAGTGCAA ATGCAAAAAC CGACAGCATT AGTCTGTTTG 

3551 CAGGCATACG GCACGATGCG GGCGATATCG GCTATCTCAA AGGCCTGTTC 

3 601 TCCTACGGAC GCTACAAAAA CAGCATCAGC CGCAGCACCG GTGCGGACGA 
3 651 ACATGCGGAA GGCAGCGTCA ACGGCACGCT GATGCAGCTG GGCGCACTGG 
3701 GCGGTGTCAA CGTTCCGTTT GCCGCAACGG GAGATTTGAC GGTCGAAGGC 
3751 GGTCTGCGCT ACGACCTGCT CAAACAGGAT GCATTCGCCG AAAAAGGCAG 
3801 TGCTTTGGGC TGGAGCGGCA ACAGCCTCAC TGAAGGCACG CTGGTCGGAC 
3851 TCGCGGGTCT GAAGCTGTCG CAACCCTTGA GCGATAAAGC C&TCCTGTTT 
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GCAACGGCGG GCGTGGAACG CGACCTGAAC GGACGCGACT ACACGGTAAC 
GGGCGGCTTT ACCGGCGCGA CTGCAGCAAC CGGCAAGACG GGGGCACGCA 
ATATGCCGCA CACCCGTCTG GTTGCCGGCC TGGGCGCGGA TGTCGAATTC 
GGCAACGGCT GGAACGGCTT GGCACGTTAC AGCTACGCCG GTTCCAAACA 
GTACGGCAAC CACAGCGGAC GAGTCGGCGT AGGCTACCGG TTCCTCGAGC 
ACCACCACCA CCACCACTGA 



MATNDDDVKK 
AADVEADDFK 
ADTDAALADT 
DTVDKHAEAF 
AAETAAGKAE 
ADVYTREESD 
VSDLRKETRQ 
SRATTAKSAA 
HTGDFPN PND 
YGRKEHGYNE 
IRHVKEIGHI 
VAAIRNAWVK 
YSGGDKTDEG 
LPFYEKDAQK 
WCLSAPYEAS 
RTTLLTTAQD 
DIAYSFRNDI 
SDMRVETKGA 
DGKGTLYTRL 
AKIGQDYSFF 
SAAAHSAPAG 
MPGIRPYGAT 
RLKAVSDGLD 
TGENTTAAAT 
SYGRYKNSIS 
GLRYDLLKQD 
ATAGVERDLN 
GNGWNGLARY 



AAT V A I AAA Y 
GLGLKKWTN 
DAALDATTNA 
NDIADSLDET 
AAAGTANTAA 
SKFVRIDGLN 
GLAEQAALSG 
VSYAGIKNEM 
AYKNLINLKP 
NYKNYTAYMR 
DLVSHIIGGR 
LGERGVRIVN 
IRLMQQSDYG 
GIITVAGVDR 
VRFTRTNPIQ 
IGAVGVDSKF 
SGTGGLIKKG 
LTYNGAASGG 
GKLLKVDGTA 
TNIETDGGLL 
LKHAVEQGGS 
FRAAAAVQHA 
HNGTGLRVIA 
LGMGRSTWSE 
RSTGADEHAE 
AFAEKGSALG 
GRDYTVTGGF 
SYAGSKQYGN 



NNGQEINGFK 
LTKTVNENKQ 
LNKLGENITT 
NTKADEAVKT 
DKAEAVAAKV 
ATTEKLDTRL 
LFQPYNVGGS 
CKDRSMLCAG 
AIEAGYTGRG 
KEAPEDGGGK 
SVDGRPAGGI 
NSFGTTSRAG 
NLSYHIRNKN 
SGEKFKREMY 
IAGTSFSAPI 
GWGLLDAGKA 
GSQLQLHGNN 
SLNSDGIVYL 
IIGGKLYMSA 
ASLDSVEKTA 
NLENLMVELD 
NAADGVRIFN 
QTQQDGGTWE 
NSANAKTDSI 
GSVNGTLMQL 
WSGNSLTEGT 
TGATAATGKT 
HSGRVGVGYR 



AGETIYDIDE 
N V D AKVKAAE 
FAEETKTNIV 
ANEAKQTAEE 
TDIKADIATN 
ASAEKSIADH 
GGGGTSAPDF 
RDDVAVTDRD 
VEVGIVDTGE 
DIEASFDDEA 
APDATLHIMN 
TADLFQIANS 
MLFIFSTGND 
GEPGTEPLEY 
VTGTAALLLQ 
MNGPASFPFG 
TYTGKTIIEG 
ADTDQSGANE 
RGKGAGYLNS 
GSEGDTLSYY 
ASESSATPET 
SLAATVYADS 
QGGVEGKMRG 
SLFAGIRHDA 
GALGGVNVPF 
LVGLAGLKLS 
GARNMPHTRL 
FLEHHHHHH* 



DGTITKKDAT 
SEIEKLTTKL 
KIDEKLEAVA 
TKQNVDAKVK 
KDNIAKKANS 
DTRLNGLDKT 
NAGGTGIGSN 
AKINAPPPNL 
SVGSISFPEL 
VIETEAKPTD 
TNDETKNEMM 
EEQYRQALLD 
AQAQPNTYAL 
GSNHCGITAM 
KYPWMSNDNL 
DFTADTKGTS 
GSLVLYGNNK 
TVHIKGSLQL 
TGRRVPFLSA 
VRRGNAARTA 
VETAAADRTD 
TAAHADMQGR 
STQTVGIAAK 
GDIGYLKGLF 
AATGDLTVEG 
QPLSDKAVLF 
VAGLGADVEF 
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961CL-ORF4 6.1 



10 



15 



20 



25 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 



ATGAAACACT 
CTGTAGCGGC 
CCACTGTGGC 
TTCAAAGCTG 
CAAAAAAGAC 
TGGGTCTGAA 
AAACAAAACG 
GTTAACAACC 
CCGCTCTGGA 
ACGACATTTG 
ATTAGAAGCC 
ATATCGCCGA 
AAAACCGCCA 
CGATGCCAAA 
CCGCTGGCAC 
AAAGTTACCG 
TAAAAAAGCA 
AATTTGTCAG 
CGCTTGGCTT 
CGGTTTGGAT 
TTGCAGAACA 
GGATCCGGAG 
GGTTCTCGAC 
GCAGCAGGGG 
ATACAAAGCC 
AGGAAATATC 
ATTCCCCCTT 
AGTCCCGTTG 
ACACCATCCC 



TTCCATCCAA 
GCACTGGCAG 
CATTGCTGCT 
GAGAGACCAT 
GCAACTGCAG 
AAAAGTCGTG 
TCGATGCCAA 
AAGTTAGCAG 
TGCAACCACC 
CTGAAGAGAC 
GTGGCTGATA 
TTCATTGGAT 
ATGAAGCCAA 
GTAAAAGCTG 
AGCTAATACT 
AC AT CAAAG C 
AACAGTGCCG 
AATTGATGGT 
CTGCTGAAAA 
AAAACAGTGT 
AGCCGCGCTC 
GAGGAGGATC 
CGTCAGCATT 
GGAACTTGCC 
ATCAGTTGGG 
GGCTACATTG 
CGACAACCAT 
ACGGATTTAG 
GCCGACGGCT 



AG T AC T G AC C 
CCACAAACGA 
GCCTACAACA 
CTACGACATT 
CCGATGTTGA 
ACTAACCTGA 
AGTAAAAGCT 
ACACTGATGC 
AACGCCTTGA 
TAAGACAAAT 
CCGTCGACAA 
GAAACCAACA 
ACAGACGGCC 
CAGAAACTGC 
GCAGCCGACA 
TGATATCGCT 
ACGTGTACAC 
CTGAACGCTA 
ATCCATTGCC 
CAGACCTGCG 
TCCGGTCTGT 
AGATTTGGCA 
TCGAACCCGA 
GAGCGCAGCG 
CAACCTGATG 
TCCGCTTTTC 
GCCTCACATT 
CCTTTACCGC 
ATGACGGGCC 



ACAGCCATCC 
CGACGATGTT 
ATGGCCAAGA 
GATGAAGACG 
AGCCGACGAC 
CCAAAACCGT 
GCAGAATCTG 
CGCTTTAGCA 
ATAAATTGGG 
AT CGT AAAAA 
GCATGCCGAA 
CTAAGGCAGA 
GAAGAAACCA 
AGCAGGCAAA 
AGGCCGAAGC 
ACGAACAAAG 
CAGAGAAGAG 
CTACCGAAAA 
GATCACGATA 
CAAAGAAACC 
TCCAACCTTA 
AACGATTCTT 
CGGGAAATAC 
GCCATATCGG 
ATTCAACAGG 
CGATCACGGG 
CCGATTCTGA 
ATCCATTGGG 
ACAGGGCGGC 



TTGCCACTTT 
AAAAAAGCTG 
AATCAACGGT 
GCACAATTAC 
TTTAAAGGTC 
CAATGAAAAC 
AAATAGAAAA 
GATACTGATG 
AGAAAATATA 
TTGATGAAAA 
GCATTCAACG 
CGAAGCCGTC 
AACAAAACGT 
GCCGAAGCTG 
TGTCGCTGCA 
ATAATATTGC 
TCTGACAGCA 
ATTGGACACA 
CTCGCCTGAA 
CGCCAAGGCC 
CAACGTGGGT 
TTATCCGGCA 
CACCTATTCG 
ATTGGGAAAA 
CGGCCATTAA 
CACGAAGTCC 
TGAAGCCGGT 
ACGGATACGA 
GGCTATCCCG 



30 



35 



40 



45 



1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 
2101 
2151 
2201 
2251 
2301 



CTCCCAAAGG 
CAAAATATCC 
TGCCGACCGT 
ACGGATTCAA 
AATGCCGCCG 
CGGCGCGGCA 
GCGAAGGCTC 
GAAAACAAGA 
AGACTATGCC 
CCGCACAAGG 
AT CAAAG G G A 
GGCACATCCT 
GGAAATCCGC 
CCGTCCCCTT 
CGGCAAAGAA 
ATGTCAAACT 
GGTAAAGGGT 
CGAG 



CGCGAGGGAT 
GCCTCAACCT 
TTCCACAATG 
ACGCGCCACC 
AAGCCTTCAA 
GGAGAAATTG 
AAACATTGCT 
TGGCGCGCAT 
GCAGCAGCCA 
CATAGAAGCC 
TTGGAGCTGT 
ATCAAGCGGT 
CGTCAGCGAC 
ACCATTCCCG 
AACATCACCT 
GGCAGACCAA 
TTCCGAATTT 



ATATACAGCT 
GACCGACAAC 
CCGGTAGTAT 
CGATACAGCC 
CGGCACTGCA 
TCGGCGCAGG 
GTCATGCACG 
CAACGATTTG 
TCCGCGATTG 
GTCAGCAATA 
TCGGGGAAAA 
CGCAGATGGG 
AATTTTGCCG 
AAATATCCGT 
CCTCAACCGT 
CGCCACCCGA 
TGAGAAGCAC 



ACGACATAAA 
CGCAGCACCG 
GCTGACGCAA 
CCGAGCTGGA 
GATATCGTTA 
CGATGCCGTG 
GCTTGGGTCT 
GC AG AT AT GG 
GGCAGTCCAA 
TCTTTATGGC 
TACGGCTTGG 
CGCGATCGCA 
ATGCGGCATA 
TCAAACTTGG 
GCCGCCGTCA 
AGACAGGCGT 
GTGAAATATG 



AGGCGTTGCC 
GACAACGGCT 
GGAGTAGGCG 
CAGATCGGGC 
AAAACATCAT 
CAGGGCATAA 
GCTTTCCACC 
CGCAACTCAA 
AACCCCAATG 
AGCCATCCCC 
GCGGCATCAC 
TTGCCGAAAG 
CGCCAAATAC 
AGCAGCGTTA 
AACGGCAAAA 
ACCGTTTGAC 
ATACGTAACT 
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1 MKHFPSKVLT TAILAT FCSG ALAATNDDDV KKAATVAIAA AYNNGQEING 

51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKW TNLTKTVNEN 

101 KQNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI 

5 151 TTFAEETKTN IVKIDEKLEA VADTVDKHAE AFNDIADSLD ETNTKADEAV 

201 KTANEAKQTA EETKQNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA 

251 KVTDIKADIA TNKDNIAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT 

301 RLASAEKSIA DHDTRLNGLD KTVSDLRKET RQGLAEQAAL SGLFQPYNVG 

351 GSGGGGSDLA NDSFIRQVLD RQHFEPDGKY HLFGSRGELA ERSGHIGLGK 

401 IQSHQLGNLM IQQAAIKGNI GYIVRFSDHG HEVHSPFDNH ASHSDSDEAG 

10 4 51 SPVDGFSLYR IHWDGYEHHP ADGYDGPQGG GYPAPKGARD I YSYDIKGVA 

501 QNIRLNLTDN RSTGQRLADR FHNAGSMLTQ GVGDGFKRAT RYSPELDRSG 

551 NAAEAFNGTA DIVKNI IGAA GEIVGAGDAV QGISEGSNIA VMHGLGLLST 

601 ENKMARINDL ADMAQLKDYA AAAIRDWAVQ NPNAAQGIEA VSNIFMAAIP 

651 IKGIGAVRGK YGLGGITAHP IKRSQMGAIA LPKGKSAVSD NFADAAYAKY 

701 PSPYHSRNIR SNLEQRYGKE NITSSTVPPS NGKNVKLADQ RHPKTGVPFD 
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30 



35 



40 



45 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 



ATGAAACACT 
CTGTAGCGGC 
CCACTGTGGC 
TTCAAAGCTG 
CAAAAAAGAC 
TGGGTCTGAA 
AAACAAAACG 
GTTAACAACC 
CCGCTCTGGA 
ACGACATTTG 
ATTAGAAGCC 
ATATCGCCGA 
AAAACCGCCA 
CGATGCCAAA 
CCGCTGGCAC 
AAAGTTACCG 
TAAAAAAGCA 
AATTTGTCAG 
CGCTTGGCTT 
CGGTTTGGAT 
TTGCAGAACA 
GGATCCGGAG 
TGCACTAACC 
CGCTGGATCA 
GGTGCGGAAA 
GAAGAACGAC 
ACGGGCAGCT 
AGCCATTCCG 
GCATTCCGGG 



TTCCATCCAA 
GCACTGGCAG 
CATTGCTGCT 
GAG AG AC CAT 
GCAACTGCAG 
AAAAGTCGTG 
TCGATGCCAA 
AAGTTAGCAG 
TGCAACCACC 
CTGAAGAGAC 
GTGGCTGATA 
TTCATTGGAT 
ATGAAGCCAA 
GTAAAAGCTG 
AGCTAATACT 
AC AT CAAAGC 
AACAGTGCCG 
AATTGATGGT 
CTGCTGAAAA 
AAAACAGTGT 
AGCCGCGCTC 
GGGGTGGTGT 
GCACCGCTCG 
GTCCGTCAGG 
AAACTTATGG 
AAGGTCAGCC 
CATTACCTTG 
CCTTAACCGC 
AAGATGGTTG 



AGTACTGACC 
CCACAAACGA 
GCCTACAACA 
CTACGACATT 
CCGATGTTGA 
ACTAACCTGA 
AGTAAAAGCT 
AC ACT GAT G C 
AACGCCTTGA 
TAAGACAAAT 
CCGTCGACAA 
GAAACCAACA 
ACAGACGGCC 
CAGAAACTGC 
GCAGCCGACA 
TGATATCGCT 
ACGTGTACAC 
CTGAACGCTA 
ATCCATTGCC 
CAGACCTGCG 
TCCGGTCTGT 
CGCCGCCGAC 
ACCATAAAGA 
AAAAACGAGA 
AAACGGTGAC 
GTTTCGACTT 
GAGAGTGGAG 
CTTTCAGACC 
CGAAACGCCA 



ACAGCCATCC 
CGACGATGTT 
ATGGCCAAGA 
GATGAAGACG 
AGCCGACGAC 
CCAAAACCGT 
GCAGAATCTG 
CGCTTTAGCA 
ATAAATTGGG 
ATCGTAAAAA 
GCATGCCGAA 
CTAAGGCAGA 
GAAGAAACCA 
AG C AG G C AAA 
AGGCCGAAGC 
ACGAACAAAG 
CAGAGAAGAG 
CTACCGAAAA 
GATCACGATA 
CAAAGAAACC 
TCCAACCTTA 
ATCGGTGCGG 
CAAAGGTTTG 
AACTGAAGCT 
AGCCTCAATA 
TATCCGCCAA 
AGTTCCAAGT 
GAG C AAAT AC 
GTTCAGAATC 



TTGCCACTTT 
AAAAAAGCTG 
AATCAACGGT 
GCACAATTAC 
TTTAAAGGTC 
CAATGAAAAC 
AAATAGAAAA 
GATACTGATG 
AGAAAATATA 
TTGATGAAAA 
GCATTCAACG 
CGAAGCCGTC 
AACAAAACGT 
GCCGAAGCTG 
TGTCGCTGCA 
ATAATATTGC 
TCTGACAGCA 
ATTGGACACA 
CTCGCCTGAA 
CGCCAAGGCC 
CAACGTGGGT 
GGCTTGCCGA 
CAGTCTTTGA 
GGCGGCACAA 
CGGGCAAATT 
ATCGAAGTGG 
ATACAAACAA 
AAGATTCGGA 
GGCGACATAG 



14 51 CGGGCGAACA TACATCTTTT GACAAGCTTC CCGAAGGCGG CAGGGCGACA 

1501 TATCGCGGGA CGGCGTTCGG TTCAGACGAT GCCGGCGGAA AACTGACCTA 

1551 CACCATAGAT TTCGCCGCCA AGCAGGGAAA CGGCAAAATC GAACATTTGA 

1601 AATCGCCAGA ACTCAATGTC GACCTGGCCG CCGCCGATAT CAAGCCGGAT 

1651 GGAAAACGCC ATGCCGTCAT CAGCGGTTCC GTCCTTTACA ACCAAGCCGA 

1701 GAAAGGCAGT TACTCCCTCG GTATCTTTGG CGGAAAAGCC CAGGAAGTTG 

1751 CCGGCAGCGC GGAAGTGAAA ACCGTAAACG GCATACGCCA TATCGGCCTT 

1801 GCCGCCAAGC AACTCGAGCA CCACCACCAC CACCACTGA 
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10 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 



MKHFPSKVLT 
FKAGETIYDI 
KQNVDAKVKA 
TTFAEETKTN 
KTANEAKQTA 
KVTDIKADIA 
RLASAEKSIA 
GSGGGGVAAD 
GAEKTYGNGD 
SHSALTAFQT 
YRGTAFGSDD 
GKRHAVISGS 
AAKQLEHHHH 



TAILAT FCSG 
DEDGTITKKD 
AESEIEKLTT 
IVKIDEKLEA 
EETKQNVDAK 
TNKDNIAKKA 
DHDTRLNGLD 
IGAGLADALT 
SLNTGKLKND 
EQIQDSEHSG 
AGGKLTYTID 
VLYNQAEKGS 
HH* 



ALAATNDDDV 
ATAADVEADD 
KLADTDAALA 
VADTVDKHAE 
VKAAETAAGK 
NSADVYTREE 
KTVSDLRKET 
APLDHKDKGL 
KVSRFDFIRQ 
KMVAKRQFRI 
FAAKQGNGKI 
YSLGIFGGKA 



KKAATVAIAA 
FKGLGLKKW 
DTDAALDATT 
AFNDIADSLD 
AEAAAGTANT 
SDSKFVRIDG 
RQGLAEQAAL 
QSLTLDQSVR 
IEVDGQLITL 
GDIAGEHTSF 
EHLKSPELNV 
QEVAGSAEVK 



AYNNGQEING 
TNLTKTVNEN 
NALNKLGENI 
ETNTKADEAV 
AADKAEAVAA 
LNATTEKLDT 
SGLFQPYNVG 
KNEKLKLAAQ 
ESGEFQVYKQ 
DKLPEGGRAT 
DLAAADIKPD 
TVNGIRHIGL 



15 



961cL-983 



20 



25 



30 



35 



40 



45 



50 



1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 
851 
901 
951 
1001 
1051 
1101 
1151 
1201 
1251 
1301 
1351 
1401 
1451 
1501 
1551 
1601 
1651 
1701 
1751 
1801 
1851 
1901 
1951 
2001 
2051 



AT G AAAC AC T 
CTGTAGCGGC 
CCACTGTGGC 
TTCAAAGCTG 
CAAAAAAGAC 
TGGGTCTGAA 
AAACAAAACG 
GTTAACAACC 
CCGCTCTGGA 
ACGACATTTG 
ATTAGAAGCC 
ATATCGCCGA 
AAAACCGCCA 
CGATGCCAAA 
CCGCTGGCAC 
AAAGTTACCG 
TAAAAAAGCA 
AATTTGTCAG 
CGCTTGGCTT 
CGGTTTGGAT 
TTGCAGAACA 
GGATCCGGCG 
CGGTATCGGC 
CTTACGCCGG 
GCCGGTCGGG 
CCCCCCCCCG 
ACAAGAATTT 
CGCGGGGTAG 
ATCCTTTCCC 
ACAAAAACTA 
GGTAAAGACA 
AGCAAAGCCG 
TGGTCTCCCA 
GGTATTGCGC 
CAAGAACGAA 
GCGAACGTGG 
GCAGGCACTG 
CCAAGCGTTG 
GCCTGATGCA 
AAAAAC AT G C 
CAACACATAT 
TTATCACAGT 



TTCCATCCAA 
GCACTGGCAG 
CATTGCTGCT 
GAGAGACCAT 
GCAACTGCAG 
AAAAGTCGTG 
TCGATGCCAA 
AAG T TAG C AG 
TGCAACCACC 
C T G AAG AG AC 
GTGGCTGATA 
TTCATTGGAT 
AT G AAG CC AA 
GTAAAAGCTG 
AGCTAATACT 
ACATCAAAGC 
AACAGTGCCG 
AATTGATGGT 
CTGCTGAAAA 
AAAACAGTGT 
AGCCGCGCTC 
GAGGCGGCAC 
AGCAACAGCA 
TATCAAGAAC 
ATGACGTTGC 
AATCTGCATA 
GATCAACCTC 
AGGTAGGTAT 
GAACTGTATG 
TACGGCGTAT 
TTGAAGCTTC 
ACGGATATCC 
TATTATTGGC 
CCGATGCGAC 
ATGATGGTTG 
CGTGCGCATC 
CCGACCTTTT 
CTCGACTATT 
AC AG AG CG AT 
TTTTCATCTT 
GCCCTATTGC 
CGCAGGCGTA 



AGTACTGACC 
CCACAAACGA 
GCCTACAACA 
CTACGACATT 
CCGATGTTGA 
ACTAACCTGA 
AGTAAAAGCT 
ACACTGATGC 
AACGCCTTGA 
TAAGACAAAT 
CCGTCGACAA 
G AAAC C AAC A 
ACAGACGGCC 
CAGAAACTGC 
GCAGCCGACA 
TGATATCGCT 
ACGTGTACAC 
CTGAACGCTA 
ATCCATTGCC 
CAGACCTGCG 
TCCGGTCTGT 
TTCTGCGCCC 
GAGCAACAAC 
GAAATGTGCA 
GGTTACAGAC 
CCGGAGACTT 
AAACCTGCAA 
CGTCGACACA 
GCAGAAAAGA 
ATGCGGAAGG 
TTTCGACGAT 
GCCACGTAAA 
GGGCGTTCCG 
G C T AC AC AT A 
CAGCCATCCG 
GTCAATAACA 
CCAAATAGCC 
CCGGCGGTGA 
TACGGCAACC 
TTCGACAGGC 
CATTTTATGA 
GACCGCAGTG 



ACAGCCATCC 
CGACGATGTT 
ATGGCCAAGA 
GAT G AAG AC G 
AGCCGACGAC 
CCAAAACCGT 
GCAGAATCTG 
CGCTTTAGCA 
ATAAATTGGG 
ATCGTAAAAA 
GCATGCCGAA 
CTAAGGCAGA 
G AAG AAAC C A 
AGCAGGCAAA 
AGGCCGAAGC 
ACGAACAAAG 
CAGAGAAGAG 
CTACCGAAAA 
GATCACGATA 
CAAAGAAACC 
TCCAACCTTA 
GACTTCAATG 
AGCGAAATCA 
AAGACAGAAG 
AGGGATGCCA 
TCCAAACCCA 
TTGAAGCAGG 
GGCGAATCCG 
ACACGGCTAT 
AAGCGCCTGA 
GAGGCCGTTA 
AGAAATCGGA 
TGGACGGCAG 
AT G AAT AC G A 
CAATGCATGG 
GTTTTGGAAC 
AATTCGGAGG 
TAAAACAGAC 
TGTCCTACCA 
AAT G AC G C AC 
AAAAGACGCT 
GAGAAAAGTT 



TTGCCACTTT 
AAAAAAGCTG 
AATCAACGGT 
GCACAATTAC 
TTTAAAGGTC 
CAATGAAAAC 
AAATAGAAAA 
GATACTGATG 
AGAAAATATA 
TTGATGAAAA 
G CAT T C AAC G 
CGAAGCCGTC 
AACAAAACGT 
GCCGAAGCTG 
TGTCGCTGCA 
ATAATATTGC 
TCTGACAGCA 
ATT GG AC AC A 
CTCGCCTGAA 
CGCCAAGGCC 
CAACGTGGGT 
CAGGCGGTAC 
GCAGCAGTAT 
CATGCTCTGT 
AAATCAATGC 
AATGACGCAT 
CTATACAGGA 
TCGGCAGCAT 
AACGAAAATT 
AGACGGAGGC 
TAGAGACTGA 
CACATCGATT 
ACCTGCAGGC 
ATGATGAAAC 
GTCAAGCTGG 
AACATCGAGG 
AGCAGTACCG 
GAGGGTATCC 
CATCCGTAAT 
AAGCTCAGCC 
CAAAAAGGCA 
CAAACGGGAA 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



2101 
2151 
2201 
2251 
2301 
2351 
2401 
2451 
2501 
2551 
2601 
2651 
2701 
2751 
2801 
2851 
2901 
2951 
3001 
3051 
3101 
3151 
3201 
3251 
3301 
3351 
3401 
3451 
3501 
3551 
3601 
3651 
3701 
3751 
3801 
3851 
3901 
3951 
4001 
4051 
4101 
4151 
4201 



ATGTATGGAG 
CGGAATTACT 
GTTTCACCCG 
CCCATCGTAA 
GAGCAACGAC 
GTGCAGTCGG 
AAGGCCATGA 
TACGAAAGGT 
GCACGGGCGG 
AACAACACCT 
GTACGGCAAC 
TTTATAACGG 
TATCTGGCAG 
AGGCAGTCTG 
AACTGCTGAA 
TCGGCACGCG 
TCCCTTCCTG 
ACATCGAAAC 
ACAGCGGGCA 
TGCGGCACGG 
AACACGCCGT 
CTGGATGCCT 
AGCCGACCGC 
GCGCAGCGGC 
TTCAACAGTC 
CGATATGCAG 
ACGGCACGGG 
TGGGAACAGG 
CGGCATTGCC 
GCATGGGACG 
AGCATTAGTC 
TCTCAAAGGC 
GCACCGGTGC 
CAGCTGGGCG 
TTTGACGGTC 
TCGCCGAAAA 
GGCACGCTGG 
TAAAGCCGTC 
GCGACTACAC 
AAGACGGGGG 
CGCGGATGTC 
ACGCCGGTTC 
TACCGGTTCT 



AACCGGGTAC 
GCCATGTGGT 
TACAAACCCG 
CCGGCACGGC 
AACCTGCGTA 
CGTGGACAGC 
ACGGACCCGC 
ACATCCGATA 
CCTGATCAAA 
ATACGGGCAA 
AACAAATCGG 
GGCGGCATCC 
ATACCGACCA 
CAGCTGGACG 
AGTGGACGGT 
GCAAGGGGGC 
AGTGCCGCCA 
CGACGGCGGC 
GTGAAGGCGA 
ACTGCTTCGG 
AGAACAGGGC 
CCGAATCATC 
AC AG AT AT GC 
AGCCGTACAG 
TCGCCGCTAC 
GGACGCCGCC 
TCTGCGCGTC 
GCGGTGTTGA 
GCGAAAACCG 
CAGCACATGG 
TGTTTGCAGG 
CTGTTCTCCT 
GGACGAACAT 
CACTGGGCGG 
GAAGGCGGTC 
AGGCAGTGCT 
TCGGACTCGC 
CTGTTTGCAA 
GGTAACGGGC 
CACGCAATAT 
GAATTCGGCA 
CAAACAGTAC 
GACTCGAG 



AGAACCGCTT 
GCCTGTCGGC 
ATTCAAATTG 
GGCTCTGCTG 
CCACGTTGCT 
AAGTTCGGCT 
GTCCTTTCCG 
TTGCCTACTC 
AAAGGCGGCA 
AAC CAT TAT C 
ATATGCGCGT 
GGCGGCAGCC 
ATCCGGCGCA 
GCAAAGGTAC 
ACGGCGATTA 
AGGCTATCTC 
AAATCGGGCA 
CTGCTGGCTT 
CACGCTGTCC 
CAGCGGCACA 
GGC AGCAATC 
CGCAACACCC 
CGGGCATCCG 
CATGCGAATG 
CGTCTATGCC 
TGAAAGCCGT 
ATCGCGCAAA 
AGGCAAAATG 
GCGAAAATAC 
AG C G AAAAC A 
CATACGGCAC 
ACGGACGCTA 
GCGGAAGGCA 
TGTCAACGTT 
TGCGCTACGA 
TTGGGCTGGA 
GGGTCTGAAG 
CGGCGGGCGT 
GGCTTTACCG 
GCCGCACACC 
ACGGCTGGAA 
GGCAACCACA 



GAGTATGGCT 
ACCCTATGAA 
CCGGAACATC 
CTGCAGAAAT 
GACGACGGCT 
GGGGACTGCT 
TTCGGCGACT 
CTTCCGTAAC 
GCCAACTGCA 
GAAGGCGGTT 
CGAAACCAAA 
TGAACAGCGA 
AACGAAACCG 
GCTGTACACA 
TCGGCGGCAA 
AACAGTACCG 
GGATTATTCT 
CCCTCGACAG 
TATTATGTCC 
TTCCGCGCCC 
TGGAAAACCT 
GAGACGGTTG 
CCCCTACGGC 
CCGCCGACGG 
GACAGTACCG 
ATCGGACGGG 
CCCAACAGGA 
CGCGGCAGTA 
G AC AG C AG C C 
GTGCAAATGC 
GATGCGGGCG 
CAAAAACAGC 
GCGTCAACGG 
CCGTTTGCCG 
CCTGCTCAAA 
GCGGCAACAG 
CTGTCGCAAC 
GGAACGCGAC 
GCGCGACTGC 
CGTCTGGTTG 
CGGCTTGGCA 
GCGGACGAGT 



CCAACCATTG 
GCAAGCGTCC 
CTTTTCCGCA 
ACCCGTGGAT 
CAGGACATCG 
GGATGCGGGT 
TTACCGCCGA 
GACATTTCAG 
ACTGCACGGC 
CGCTGGTGTT 
GGTGCGCTGA 
CGGCATTGTC 
TACACATCAA 
CGTTTGGGCA 
GCTGTACATG 
GACGACGTGT 
TTCTTCACAA 
CGTCGAAAAA 
GTCGCGGCAA 
GCCGGTCTGA 
GATGGTCGAA 
AAACTGCGGC 
GCAACTTTCC 
TGTACGCATC 
CCGCCCATGC 
TTGGACCACA 
CGGTGGAACG 
CCCAAACCGT 
GCCACACTGG 
AAAAACCGAC 
ATATCGGCTA 
ATCAGCCGCA 
CACGCTGATG 
CAACGGGAGA 
CAGGATGCAT 
CCTCACTGAA 
CCTTGAGCGA 
CTGAACGGAC 
AGCAACCGGC 
CCGGCCTGGG 
CGTTACAGCT 
CGGCGTAGGC 
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1 MKHFPSKVLT TAILATFCSG ALAATNDDDV KKAATVAIAA AYNNGQEING 

51 FKAGETIYDI DEDGTITKKD ATAADVEADD FKGLGLKKVV TNLTKTVNEN 

101 KQNVDAKVKA AESEIEKLTT KLADTDAALA DTDAALDATT NALNKLGENI 

151 TTFAEETKTN IVKIDEKLEA VADTVDKHAE AFNDIADSLD ETNTKADEAV 

5 201 KTANEAKQTA EETKQNVDAK VKAAETAAGK AEAAAGTANT AADKAEAVAA 

251 KVTDIKADIA TNKDNIAKKA NSADVYTREE SDSKFVRIDG LNATTEKLDT 

301 RLASAEKSIA DHDTRLNGLD KTVSDLRKET RQGLAEQAAL SGLFQPYNVG 

351 GSGGGGTSAP DFNAGGTGIG SNSRATTAKS AAV S Y AG I KN EMCKDRSMLC 

401 AGRDDVAVTD RDAKINAPPP NLHTGDFPNP NDAYKNLINL KPAIEAGYTG 

10 451 RGVEVGIVDT GESVGSISFP ELYGRKEHGY NENYKNYTAY MRKEAPEDGG 

501 GKDIEASFDD EAVIETEAKP TDIRHVKEIG HIDLVSHIIG GRSVDGRPAG 

551 GIAPDATLHI MNTNDETKNE MMVAAIRNAW VKLGERGVRI VNNSFGTTSR 

601 AGTADLFQIA NSEEQYRQAL LDYSGGDKTD EGIRLMQQSD YGNLSYHIRN 

651 KNMLFIFSTG NDAQAQPNTY ALLPFYEKDA QKGIITVAGV DRSGEKFKRE 

701 MYGEPGTEPL EYGSNHCGIT AMWCLSAPYE ASVRFTRTNP IQIAGTSFSA 

15 751 PIVTGTAALL LQKYPWMSND NLRTTLLTTA QDIGAVGVDS KFGWGLLDAG 

801 KAMNGPASFP FGDFTADTKG TSDIAYSFRN DISGTGGLIK KGGSQLQLHG 

851 NNTYTGKTII EGGSLVLYGN NKSDMRVETK GALIYNGAAS GGSLNSDGIV 

901 YLADTDQSGA NETVHIKGSL QLDGKGTLYT RLGKLLKVDG TAI IGGKLYM 

951 SARGKGAGYL NSTGRRVPFL SAAKIGQDYS FFTNIETDGG LLASLDSVEK 

1001 TAGSEGDTLS YYVRRGNAAR TASAAAHSAP AGLKHAVEQG GSNLENLMVE 

20 1051 LDASESSATP ETVETAAADR TDMPGIRPYG AT FRAAAAVQ HAN AADGVR I 

1101 FNSLAATVYA DSTAAHADMQ GRRLKAVSDG LDHNGTGLRV IAQTQQDGGT 



1151 WEQGGVEGKM 

1201 SISLFAGIRH 

1251 QLGALGGVNV 

1301 GTLVGLAGLK 

1351 KTGARNMPHT 

1401 YRF* 



RGSTQTVGIA AKTGENTTAA 
DAGDIGYLKG LFSYGRYKNS 
PFAATGDLTV EGGLRYDLLK 
LSQPLSDKAV LFATAGVERD 
RLVAGLGADV EFGNGWNGLA 



ATLGMGRSTW SENSANAKTD 
ISRSTGADEH AEGSVNGTLM 
QDAFAEKGSA LGWSGNSLTE 
LNGRDYTVTG GFTGATAATG 
RYSYAGSKQY GNHSGRVGVG 



[0279] It will be understood that the invention has been described by way of example only and modifications may be 
made whilst remaining within the scope and spirit of the invention. For instance, the use of proteins from other strains 
is envisaged [e.g. see WO00/66741 for polymorphic sequences for ORF4, ORF40, ORF46, 225, 235, 287, 519, 726, 
35 91 9 and 953]. 

EXPERIMENTAL DETAILS 

FPLC protein purification 

40 

[0280] The following table summarises the FPLC protein purification that was used: 



Protein 


PI 


Column 


Buffer 


PH 


Protocol 


-| 21 J untagged 


6.23 


Mono Q 


Tris 


8.0 


A 


1 28.1 untagged 


5.04 


Mono Q 


Bis-Tris propane 


6.5 


A 


406.1 L 


7.75 


Mono Q 


Diethanolamine 


9.0 


B 


576.1 L 


5.63 


Mono Q 


Tris 


7.5 


B 


593untagged 


8.79 


Mono S 


Hepes 


7.4 


A 


72Quntagged 


4.95 


Hi-trap S 


Bis-Tris 


6.0 


A 


g-| guntagged 


10.5(-leader) 


Mono S 


Bicine 


8.5 


C 


919Lorf4 


10.4(-leader) 


Mono S 


Tris 


8.0 


B 


920L 


6.92(-leader) 


Mono Q 


Diethanolamine 


8.5 


A 


953L 


7.56(-leader) 


Mono S 


MES 


6.6 


D 
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(continued) 



Protein 


PI 


Column 


Buffer 


PH 


Protocol 


982untagged 


4.73 


Mono Q 


Bis-Tris propane 


6.5 


A 


919-287 


6.58 


Hi-trap Q 


Tris 


8.0 


A 


953-287 


4.92 


Mono Q 


Bis-Tris propane 


6.2 


A 



[0281] Buffer solutions included 20-120 mM NaCI, 5.0 mg/ml CHAPS and 10% v/v glycerol. The dialysate was cen- 
trifuged at 13000g for 20 min and applied to either a mono Q or mono S FPLC ion-exchange resin. Buffer and ion 
exchange resins were chosen according to the pi of the protein of interest and the recommendations of the FPLC protocol 
manual [Pharmacia: FPLC Ion Exchange and Chromatofocussing; Principles and Methods. Pharmacia Publication]. 
Proteins were eluted using a step-wise NaCI gradient. Purification was analysed by SDS-PAGE and protein concentration 
determined by the Bradford method. 

[0282] The letter in the 'protocol' column refers to the following: 

[0283] FPLC-A: Clones 121 .1 , 128.1, 593, 726, 982, periplasmic protein 920L and hybrid proteins 919-287, 953-287 
were purified from the soluble fraction of E.coli obtained after disruption of the cells. Single colonies harbouring the 
plasmid of interest were grown overnight at 37°C in 20 ml of LB/Amp (100 jjug/ml) liquid culture. Bacteria were diluted 
1 :30 in 1 .0 L of fresh medium and grown at either 30°C or 37°C until the OD 550 reached 0.6-08. Expression of recombinant 
protein was induced with IPTG at a final concentration of 1 .0 mM. After incubation for 3 hours, bacteria were harvested 
by centrifugation at 8000g for 1 5 minutes at 4°C. When necessary cells were stored at -20°C. All subsequent procedures 
were performed on ice or at 4°C. For cytosolic proteins (121 .1 , 128.1 , 593, 726 and 982) and periplasmic protein 920L, 
bacteria were resuspended in 25 ml of PBS containing complete protease inhibitor (Boehringer-Mannheim). Cells were 
lysed by by sonication using a Branson Sonifier 450. Disrupted cells were centrifuged at 8000g for 30 min to sediment 
unbroken cells and inclusion bodies and the supernatant taken to 35% v/v saturation by the addition of 3.9 M (NH 4 ) 2 S0 4 . 
The precipitate was sedimented at 8000g for 30 minutes. The supernatant was taken to 70% v/v saturation by the 
addition of 3.9 M (NH 4 ) 2 S0 4 and the precipitate collected as above. Pellets containing the protein of interest were 
identified by SDS-PAGE and dialysed against the appropriate ion-exchange buffer (see below) for 6 hours or overnight. 
The periplasmic fraction from E.coli expressing 953L was prepared according to the protocol of Evans et. al. [Infect. Immun. 
(1974), 10:1010-1017] and dialysed against the appropriate ion-exchange buffer. Buffer and ion exchange resin were 
chosen according to the pi of the protein of interest and the recommendations of the FPLC protocol manual (Pharmacia). 
Buffer solutions included 20 mM NaCI, and 10% (v/v) glycerol. The dialysate was centrifuged at 13000g for 20 min and 
applied to either a mono Q or mono S FPLC ion-exchange resin. Buffer and ion exchange resin were chosen according 
to the pi of the protein of interest and the recommendations of the FPLC protocol manual (Pharmacia). Proteins were 
eluted from the ion-exchange resin using either step-wise or continuous NaCI gradients. Purification was analysed by 
SDS-PAGE and protein concentration determined by Bradford method. Cleavage of the leader peptide of periplasmic 
proteins was demonstrated by sequencing the NH 2 -terminus (see below). 

[0284] FPLC-B: These proteins were purified from the membrane fraction of E.coli. Single colonies harbouring the 
plasmid of interest were grown overnight at 37°C in 20 ml of LB/Amp (100 jjug/ml) liquid culture. Bacteria were diluted 
1 :30 in 1.0 L of fresh medium. Clones 406.1 L and 919LOrf4 were grown at 30°C and Orf25L and 576.1 L at 37°C until 
the OD 550 reached 0.6-0.8. In the case of 919LOrf4, growth at 30°C was essential since expression of recombinant 
protein at 37°C resulted in lysis of the cells. Expression of recombinant protein was induced with IPTG at a final con- 
centration of 1 .0 mM. After incubation for 3 hours, bacteria were harvested by centrifugation at 8000g for 15 minutes at 
4°C. When necessary cells were stored at -20 °C. All subsequent procedures were performed at 4°C. Bacteria were 
resuspended in 25 ml of PBS containing complete protease inhibitor (Boehringer-Mannheim) and lysed by osmotic 
shock with 2-3 passages through a French Press. Unbroken cells were removed by centrifugation at 5000gfor 15 min 
and membranes precipitated by centrifugation at 100000g (Beckman Ti50, 38000rpm) for 45 minutes. A Dounce ho- 
mogenizer was used to re-suspend the membrane pellet in 7.5 ml of 20 mM Tris-HCI (pH 8.0), 1 .0 M NaCI and complete 
protease inhibitor. The suspension was mixed for 2-4 hours, centrifuged at 1 0OOOOg for 45 min and the pellet resuspended 
in 7.5 ml of 20mM Tris-HCI (pH 8.0), 1 .0M NaCI, 5.0mg/ml CHAPS, 1 0% (v/v) glycerol and complete protease inhibitor. 
The solution was mixed overnight, centrifuged at 100000g for 45 minutes and the supernatant dialysed for 6 hours 
against an appropriately selected buffer. In the case of Orf25.L, the pellet obtained after CHAPS extraction was found 
to contain the recombinant protein. This fraction, without further purification, was used to immunise mice. 
[0285] FPLC-C: Identical to FPLC-A, but purification was from the soluble fraction obtained after permeabilising E.coli 
with polymyxin B, rather than after cell disruption. 

[0286] FPLC-D: A single colony harbouring the plasmid of interest was grown overnight at 37°C in 20 ml of LB/Amp 
(1 00 fxg/ml) liquid culture. Bacteria were diluted 1 :30 in 1 .0 L of fresh medium and grown at 30°C until the OD 550 reached 
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0.6-0.8. Expression of recombinant protein was induced with IPTG at a final concentration of 1 .OmM. After incubation 
for 3 hours, bacteria were harvested by centrifugation at 8000g for 1 5 minutes at 4°C. When necessary cells were stored 
at -20 °C. All subsequent procedures were performed on ice or at 4°C. Cells were resuspended in 20mM Bicine (pH 
8.5), 20mM NaCI, 1 0% (v/v) glycerol, complete protease inhibitor (Boehringer-Mannheim) and disrupted using a Branson 
Sonifier 450. The sonicate was centrifuged at 8000g for 30 min to sediment unbroken cells and inclusion bodies. The 
recombinant protein was precipitated from solution between 35% v/v and 70% v/v saturation by the addition of 3.9M 
(NH 4 ) 2 S0 4 . The precipitate was sedimented at 8000g for 30 minutes, resuspended in 20 mM Bicine (pH 8.5), 20 mM 
NaCI, 10% (v/v) glycerol and dialysed against this buffer for 6 hours or overnight. The dialysate was centrifuged at 
13000g for 20 min and applied to the FPLC resin. The protein was eluted from the column using a step-wise NaCI 
gradients. Purification was analysed by SDS-PAGE and protein concentration determined by Bradford method. 

Cloning strategy and oligonucleotide design 

[0287] Genes coding for antigens of interest were amplified by PCR, using oligonucleotides designed on the basis of 
the genomic sequence of N. meningitidis B MC58. Genomic DNA from strain 2996 was always used as a template in 
PCR reactions, unless otherwise specified, and the amplified fragments were cloned in the expression vector pET21 
D+ (Novagen) to express the protein as C-terminal His-tagged product, or in pET-24b+(Novagen) to express the protein 
in 'untagged' form (e.g. AG 287K). 

[0288] Where a protein was expressed without a fusion partner and with its own leader peptide (if present), amplification 
of the open reading frame (ATG to STOP codons) was performed. 

[0289] Where a protein was expressed in 'untagged' form, the leader peptide was omitted by designing the 5' -end 
amplification primer downstream from the predicted leader sequence. 

[0290] The melting temperature of the primers used in PCR depended on the number and type of hybridising nucleotides 
in the whole primer, and was determined using the formulae: 



T m i = 4(G+C)+2(A+T) (tail excluded) 



T m2 = 64.9 + 0.41 (% GC) - 600/N (whole primer) 

[0291 ] The melting temperatures of the selected oligonucleotides were usually 65-70°C for the whole oligo and 50-60°C 
for the hybridising region alone. 

[0292] Oligonucleotides were synthesised using a Perkin Elmer 394 DNA/RNA Synthesizer, eluted from the columns 
in 2.0ml NH 4 OH, and deprotected by 5 hours incubation at 56°C. The oligos were precipitated by addition of 0.3M Na- 
Acetate and 2 volumes ethanol. The samples were centrifuged and the pellets resuspended in water. 
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Sequences 


Restriction 
site 








CGCdCi ATPPPiPTAPP-A A A AC A APPPiAPA A APPG 

V/VJ\_/VJVJI AV 1 V_^V_^VJV^ 1 AvJ \^~/A /A/A/AV^/A/A /A V^/A/A/A V^- VJ 


IN IIC1 


5 




rvcv 


rrrnrTrnAri TTArrAnrrTr,TAr,rrTA 

vLLuv^ I LUAU- l 1 /aH^/aVjUjvj 1 /\UV/L/ 1 /A 


Anoi 




urn 


Fvvd 


fTAfiPTAnr PP AP AP APTTATTTPPPP ATP 


iNnei 






Rev 


OCCOC I LuAO- 1 1 AttAULljO 1 Autt 1 AA 1 1 J O 


Xhol 


10 


f\~ri i n«i« A 

u r 1 1 iaj m p A 


Fwd 




iNaei-v, JNnei j 




Rev 


CCCGo 1 LuAO- 


Xhol 






Fwd 


rr.rr,r. a Trrr a t a Tr. a a a a rrTTrTTP a a a a c^c^ 
LuLuuA 1 l-LLA 1 A 1 U-AAAAtt 1 1L1 1 tAAAALL 


INaei 






Rev 


LttOt 1 LuAvj- 1 1 A 1 1 I Out 1 OOOCC 1 1 O 


Anoi 




Ori7-l L 


rwa 


nC'CO.r* A TT A AT A TDTTP A H A A A A T'~VClT"m AAA Tflfl 

OOOOOA 1 I AA 1 -A lull UAuAAAA 1 lu 1 1 uAAA 1 OO 


Asel 


15 




Rev 


f^Cf^f^f^f^TTT^ A C* TT A TTTTTTr A A A A T A T A TTTnr 

OOOOOO 1 OOAO- 1 1 A 1 I 1 1 1 1 LAAAA 1 A 1 A 1 1 1 Ut 


Ail 01 




Dr\y- 1 L 


rwa 


PPPPPP AT ATP TT A PPT A APPP.TTTP A A A A TPT 
uLuuULA 1 A 1 O- 1 1 AtC 1 AAtLu 111 LAAAA 1 \J 1 


INaei 






Rev 


uLuuUU 1 LuAu- 1 1 A 1 1 1 OOO Auu 1111 Luuu 


Anoi 




Orf23L 


Fwd 


CGCGGATCCCATATG-ACACGCTTCAAATATTC 


Ndel 


20 




Rev 


CCCGCTCGAG-TTATTTAAACCGATAGGTAAA 


Xhol 




Orf25-l His 


Fwd 


CGCGGATCCCATATG-GGCAGGGAAGAACCGC 


Ndel 






Rev 


GCCCAAGCTT-ATCGATGGAATAGCCGCG 


Hindlll 




Orf29-l b-His 


Fwd 


CGCGGATCCGCTAGC-AACGGTTTGGATGCCCG 


Nhel 


25 


(MC58) 


Rev 


CCCGCTCG AG-TTTGTCTAAGTTCCTGATAT 
CCCGCTCG AG- ATTCCCACCTGCC ATC 


Xhol 




/"A r> 1 U T 


rwa 


r^nr^nn a Tr^r^nr^T a nr^ a tci a a TTTr.rrT a ttp a a a a a t 
OOOOO A 1 OOOO 1 Aut-A I OAA I 1 1 OOO 1 A J I tAAAAA 1 


rsnei 






Rev 


cccr^r^TC^r^ a p tt a a TTrrr a pptppp a tp 

LLLuL I LUAu- 1 1 AA 1 1 LtLALL 1 uLLA 1 V_ 


A.noi 


30 


Cirrzy-i c-His 


rwa 


rrirrin ATprr,rTAr,r at/^a atttppptattpa a a a at 

LULUuA 1 LLUL 1 AOO-A 1 OAA 111 ULt 1 A 1 1 tAAAAA 1 


fNnei 




(ML5o) 


Rev 


LLLut 1 LuAu- 1 1 uuALuA 1 UCLCuLuA 


Anoi 




ijrizy-i c-l» 


rwa 


enenn atpppptapp atp a atttppptattpa a a a at 

LutUuA I LLUL 1 AuL-A 1 OAA 1 1 1 1 A 1 I LAAAAA 1 


fsnei 




(MC58) 


Rev 


CCCGCTCG AG-TTATTGGACGATGCCCGC 


Xhol 


35 


Orf25L 


Fwd 


CGCGGATCCCATATG-TATCGCAAACTGATTGC 


Ndel j 




Rev 


CCCGCTCGAG-CTAATCGATGGAATAGCC 


Xhol 




Orf37L 


Fwd 


CGCGGATCCCATATG-AAACAGACAGTCAAATG 


Ndel 






Rev 


CCCGCTCG AG-TCAATAACCCGCCTTCAG 


Xhol 


40 


Orf38L 


Fwd 


CGCGGATCCCATATG- 

TTACGTTTGACTGCTTTAGCCGTATGCACC 


Ndel 






Rev 


CCCGCTCGAG- 

TTATTTToCCGCGTTAAAAoGGTCoGGAAC 


Xhol 






Fwd 


enf^nn atccc* at atp a apa a a atatappppat 

tULuuA I LLLA 1 A 1 u-AALAAAA 1 A 1 AttuLA J 




45 




ivev 


rT i r , nr"TCa ap ttatp ai^tp ata Arrr; at 

tLLUL 1 LUAU- 1 1 ALLAL 1 OA 1 AALtuAt 


Anoi 




VJrl4U.X-rllS 


rwa 


rricnn a tppp ata tp a ppp a tp a pp a pp a ttt a t 

OOOOOA 1 OOOA 1 A 1 Vj-Av^vjr /\ 1 UAl/UAtUA 1 I 1 A 1 


iNaei 






D OA/ 


C\CCC* A A P.PTT PP A PTP A T A A PPP A P A P A 
OOOOAAOOl 1 -LtAL 1 OA 1 AAttuAtAu A 


ninaiii 




Ort4U.2L< 


Fwd 


cn c^r* n a Tc^r^r* ata tc a a a a atata c*c*r y a t 
OOOOOA 1 LLLA 1 A 1 O-AAOAAAA 1 A 1 AttuLA 1 


INaei 


50 




Rev 


GCCC A AGPTT-TTAPP APTG AT A APPPi AP 


Hindlll 

1111 IU 111 




Orf46-2L 


Fwd 


GGGAATTCCATATG-GGCATTTCCCGCAAAATATC 


Ndel 






Rev 


CCCGCTCGAG-TTATTTACTCCTATAACGAGGTCTCTTAAC 


Xhol 




Orf46-2 


Fwd 


GGGAATTCCATATG-TCAGATTTGGCAAACGATTCTT 


Ndel 


55 




Rev 


CCCGCTCGAG-TTATTTACTCCTATAACGAGGTCTCTTAAC 


Xhol 


Orf46.1L 


Fwd 


GGGAATTCCATATG-GGCATTTCCCGCAAAATATC 


Ndel 



79 
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Rev 


CCCGCTCGAG-TTACGTATCATATTTCACGTGC 


Xhol 




orf46. (His-GST) 


Fwd 


GGGAATTCCATATGCACGTGAAATATGATACGAAG 


BamHI-Ndel 


5 




Rev 


CCCGCTCGAGTTTACTCCTATAACGAGGTCTCTTAAC 


Xhol 




orf46.1-His 


Fwd 


GGGAATTCCATATGTCAGATTTGGCAAACGATTCTT 


Ndel 






Rev 


CCCGCTCGAGCGTATCATATTTCACGTGC 


Xhol 




orf46.2-His 


Fwd 


GGGAATTCCATATGTCAGATTTGGCAAACGATTCTT 


Ndel 


10 




Rev 


CCCGCTCGAGTTTACTCCTATAACGAGGTCTCTTAAC 


Xhol 




Orf65-l-(His/GST) 


Fwd 


CGCGGATCCCATATG-CAAAATGCGTTCAAAATCCC 


BamHI-Ndel 




(MC58) 


Rev 


CGCGG ATCCCATATG-AACAAAATATACCGCAT 
CCCGCTCGAG -TTTGCTTTCGATAGAACGG 


Xhol 


15 


Ori72-lL 


rwa 


r^iT^r* f~*(~* ATA Tf^ {^"TC* ATA A A A T A T A A A A TTTP. A A 

uLuuLLA I A 1 O-O 1 LA 1 AAAA 1 A 1 ALAAA 111 OAA 


XTz1*»1 






Rev 


OCOOCC 1 tuAU- 1 1 AutL I O AO AtC 111 UtAAA 1 I 


AnOi 




Orf/o-lL. 


Fwd 


r^rr^r^r^r^ a t a tp aaa^a^aaaaaaa 
uLuuLLA 1 A 1 U-AAALAUAAAAAAALLUL 1 O 


rsaei 






Rev 


GCGGCCTCGAG-TTACGGTTTGACACCGTTTTC 


Xhol 


20 


Orf83.1L 


Fwd 


CGCGG ATCCCATA 1 U-AAAACLL I uL 1 CC 1 C 


Ndel 




Rev 


CCCGCTCGAG-TTATCCTCCTTTGCGGC 


Xhol 




Orf85-2L 


Fwd 


f-yr-^ /-> /—>/-» A TP A TTZ - • Z - ' /"> A A A A A TO a Tf~< A A a T/™ 1 Z"' / — ' 

GCGGCCATATG-GCAAAAATGATGAAAToOO 


Ndel 






Rev 


GCGGCCTCGAG-TTATCGGCGCGGCGGGCC 


Xhol 


25 


Orf91L (MC58) 


Fwd 
Rev 


GCGGCCATATGAAAAAATCCTCCCTCATCA 
GCGGCCTCGAGTTATTTGCCGCCGTTTTTGGC 


Ndel 
Xhol 




Orf91-His(MC58) 


Fwd 
Rev 


GCGGCCATATGGCCCCTGCCGACGCGGTAAG 
GCGGCCTCGAGTTTGCCGCCGTTTTTGGCTTTC 


Ndel 
Xhol 


30 


Orf97-lL 


Fwd 
Rev 


GCGGCCATATG-AAACACATACTCCCCCTGA 
GCGGCCTCGAG-TTATTCGCCTACGGTTTTTTG 


Ndel 
Xhol 




Orfll9L(MC58) 


Fwd 
Rev 


GCGGCCATATGATTTACATCGTACTGTTTC 
GCGGCCTCGAGTTAGGAGAACAGGCGCAATGC 


Ndel 
Xhol 


35 


Orfll9-His(MC58) 


Fwd 
Rev 


GCGGCCATATGTACAACATGTATCAGGAAAAC 
GCGGCCTCGAGGGAGAACAGGCGCAATGCGG 


Ndel 
Xhol 




Orfl37.1 (His- 
GST) (MC58) 


Fwd 


CGCGG ATCCGCTAGCTGCGGCACGGCGGG 


BamHI-Nhel 






Rec 


CCCGCTCGAGATAACGGTATGCCGCCAG 


Xhol 


40 


Orfl43-lL 


Fwd 
Rev 


/"l/" 1 t*> A T , Z"1 Z^ Z~^ A T 1 A T/""* A A TPZ'*' A A Z~< A Z*"V I " 1 " 1 A Z - * 1 

CGCGGATCCC ATATG-G AATCA AC AC Y I I CAC 
CCCGCTCGAG-TTACACGCGGTTGCTGT 


Ndel 
Xhol 




008 


Fwd 


f^/-* i~"if~* /~~* A T"'/'" 1 Z"^ /~< A TP A T 1 /^ A A A A z~i A /""• A Z^ A TTTTr 1 

CGCGOA 1 CCCA 1 A I O-AACAACAOACA 1 1 1 lO 


Ndel 






Rev 


CCCGC T CGAG- T TACCTGTCCGGTAAAAG 


Xnol 


45 


050-1(48) 


rwa 


CCjCCjCjA 1 CCCrC 1 AUC-ACCCj 1 CA 1 LAAALAuUAA 


Nnei 




Rev 


CCCGCTCGAG-TCAAGATTCGACGGGGA 


Xhol 




105 


rwa 


CCjCOCjA 1 CCCA 1 A 1 0- 1 LtuLAAALuAA 1 ALU 


Naei 






Rev 


LL-HjL 1 LuAu- I LAO lOt 1LI ULLAU 1 1 1 


Anoi 


50 


iiil 

1 1 1 JU 


Fwd 




Ndel 




Rev 


CCCGCTCGAG-TTAGCGGAGCAGTTTTTC 


Xhol 




117-1 


Fwd 


CGCGG ATCCCATATG-ACCGCCATCAGCC 


Ndel 






Rev 


CCCGCTCGAG-TTAAAGCCGGGTAACGC 


Xhol 


55 


121-1 


Fwd 


GCGGCCATATG-GAAACACAGCTTTACATCGG 


Ndel 




Rev 


GCGGCCTCGAG-TCAATAATAATATCCCGCG 


Xhol 



80 
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122-1 


Fwd 


GCGGCCATATG-ATTAAAATCCGCAATATCC 


Ndel 






Daw 

ivcv 


dCCidCCTCCi Afi-TTA A ATPTTGGTAG ATTGG ATTTGO 

VJVw' vJ VJ V^ V_- 1 V»xVJ/AvJ- 1 1 /A ./A /A 1 1 1 VJ VJ f r\\J /A I 1 VJ VJ rv 111 VJ VJ 


Xhol 


5 


11Ji 1 


Fwd 


gpggppatatg-aptgapaapgpaptgptpp 

VJVw>VJ VJV^V_^/A 1 /A 1 VJ rtV/ 1 VJ/A V^/A/AV^VJ V^/A \s I VJV^ 1 V^Vj 


Ndel 






Kcv 


vjv^vj vJv^\-^ 1 v^Vj/VO- 1 v^/\vj/\v^v_,vjv^vj I 1U 1 v>vJ/\/\/\v^ 








Fwd 


cncaa ATrrrATATn ppptta a a a ap atp a a a 

v_/VJV_/VJ\J/-\ 1 1 /\ 1 VJ-VjV^vj 1 1 /\/A/A/A/\V_r/A 1 v>/A/\/ \ 


1NUC1 






Rev 


CCCGCTCGAG-TCAGCCCTTCATACAGC 


Xhol 


10 


149.1L(MC58) 


Fwd 


GCGGCATTAATGGCACAAACTACACTCAAACC 


Asel 




Rev 


GCGGCCTCGAGTTAAAACTTCACGTTCACGCCG 


Xhol 




14SU-His(MC58) 


Fwd 


GCGGCATTAATGCATGAAACTGAGCAATCGGTGG 


Asel 






Rev 


GCGGCCTCGAGAAACTTCACGTTCACGCCGCCGGTAAA 


Xhol 


15 


205 (His-GST) 
(MC58) 


Fwd 


CGCGGATCCCATATGGGCAAATCCGAAAATACG 


BamHl-Ndel 






Rev 


CCCGCTCGAGATAATGGCGGCGGCGG 


Xhol 




206L 


Fwd 


CGCGGATCCCATATG-TTTCCCCCCGACAA 


Ndel 






Rev 


CCCGCTCGAG-TCATTCTGTAAAAAAAGTATG 


Xhol 


20 


214 (His-GST) 
(MC58) 


Fwd 


CGCGGATCCCATATGCTTCAAAGCGACAGCAG 


BamHI-Ndel 






rvCV 


cccnc'TT'n a pttppp a tttttgppt a ptp 

ULLUL 1 v^vJ/Vvj 1 I l_-vjvj/\ 11111 VJv_-vJ 1 AL- 1 V_- 


YhnT 

.A.1IU1 








PGPGGATPPP AT ATG-GP A ATGGP AG A AAAPG 

V- VJ V^ VJVJ /A 1 v^\^\-'rt 1 /A 1 VJ— \J\_,r\r\ 1 VJ VJ V^ VA VJ r\r\r\r\' — VJ 


Ndel 


25 




IvC V 


rrrr.rTrnAr, ptatapa atppptpppp; 

V^V^, V_^VJ V^ 1 V^Vj/AVJ-V^ 1 rt 1 AWAn 1 V_.V_^VJ 1 Vj V_- V_^VJ 


YhnT 
yvi iui 




1 1 

ZZ5~1 Li 


rWQ 


PPPPPl A TPPP ATA TH PI A TTPTTTTTTP AAA PP 

V_J<J V_^ VJVJ/A 1 VwV^V^/\ 1 /A I VJ-VJ/A 1 lv/1 1 1 1 1 1 V^/A/A/AVw- V_ 


INUCl 






Rev 


PPPHPTPPi AH-TPAOTTPAG A A AGPGGG 

V-<V/V/VJv/ 1 VwVJ/AVJ- 1 V^/AVJ 1 1 V_^/A VJ./A/A/A VJ V_^VJ VJ VJ 


Xhol 






Fwd 


CGCGG ATPPP AT ATG- A A APPTTTG A TTTTAGG 

V_/VJ V>VJ VJ/A 1 V^V_/Vw/A 1 /A 1 VJ AnAV/Vy 111 VJ /A 1111 /lVJ VJ 


Ndel 


30 




P. PV 

rvc v 


P P Pn P TP Pi A n -TT A TTTGPGPTGP TP TTP 

V_JwV_/VJV_^ 1 w VJ r\ VJ 1 1 /A 111 VJ VJVJV^ 1 VJv-' 1 V-/ 1 1 


Xhol 




r wu 


CCiCna A TPPP A T A TPI-OT A A TPPiTPTPifiTTPi 

V^/Vj V^ VJVJ/ \ 1 VvVvL/A 1 A 1 VJ _ VJ 1 /A/A 1 V^VJ 1 V/l VJVJ 1 1 VJ 


INUCl 






Rev 


CCCGCTCGAG-CTACGACTTGGTTACCG 


Xhol 




247-1 L 


Fwd 


GCGGCCATATG-AGACGTAAAATGCTAAAGCTAC 


Ndel 


35 




Rev 


GCGGCCTCGAG-TCAAAGTGTTCTGTTTGCGC 


Xhol 


264-His 


Fwd 


GCCGCCATATG-TTGACTTTAACCCGAAAAA 


Ndel 






Rev 


GCCGCCTCGAG-GCCGGCGGTCAATACCGCCCGAA 


Xhol 




270 (His-GST) 
(JV1C58) 


Fwd 


CGCGG ATCCCATATGGCGCAATGCGATTTGAC 


BamHI-Ndel 


40 




Daw 

IVCV 


pppnpTpp AnTTpnnppTOTA a atpppci 

V-/VyV/VJV^ 1 V^VJ/AVJ 1 1 V^VJ VJ V_/VJVJ 1 /A/A/A 1 VJ V_- Vw/ VJ 


Xhol 




274L 


Fwd 


GCGGCCATATG-GCGGGGCCGATTTTTGT 


Ndel 






Rev 


GCGGCCTCGAG-TTATTTGCTTTCAGTA TTATTG 


Xhol 




283L 


Fwd 


GCGGCCATATG-AACTTTGCTTTATCCGTCA 


Ndel 


45 




Rev 


GCGGCCTCGAG-TTAACGGCAGTATTTGTTTAC 


Xhol 




285-His 


Fwd 


CGCGGATCCCATATGGGTTTGCGCTTCGGGC 


BamHI 






Rev 


GCCCAAGCTTTTTTCCTTTGCCGTTTCCG 


Hindlll 




286-His 


Fwd 


CGCGG ATCCCATATG-GCCGACCTTTCCGAAAA 


Ndel 


50 


(MC58) 


Rev 


CCCGCTCGAG-GAAGCGCGTTCCCAAGC 


Xhol 




286L 


Fwd 


CGCGG ATCCCATATG-CACGACACCCGTAC 


Ndel 




(MC58) 


Rev 


CCCGCTCGAG-TTAGAAGCGCGTTCCCAA 


Xhol 




287L 


Fwd 


CTAGCTAGC-TTTAAACGCAGCGTAATCGCAATGG 


Nhel 


55 




Rev 


CCCGCTCGAG-TCAATCCTGCTCTTTTTTGCC 


Xhol 



81 
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287 


Fwd 


CTAGCTAGC-GGGGGCGGCGGTGGCG 


Nhel 


Rev 


CCCGCTCGAG-TCAATCCTGCTCTTTTTTGCC 


Xhol 


287LOrf4 


Fwd 


CTAGCTAGCGCTCATCCTCGCCGCC- 
TGCGGGGGCGGCGGT 


Nhel 


1? A\r 
IVCV 


C*C y (~ y r**r^T'C^(~* A O TO A A TOOTO PTP I "I'T'I'M'TO/-*/^ 

LLLOL I LuAU- 1 LAA I CC I uL 1 CI 1 1 1 I I OCC 


Xhol 


Zo /-III 


rwa 


LUUOuA I CC-GGGGGCGGCGG I GGCG 


BamHI 


Rev 


CCCGC I CGAG-TCAATCCTGCTCTTTTTTGCC 


Xhol 


287-rilS 


rwa 


C 1 AGC 1 AOC-GGGGGCGGCGGTGGCG 


Nhel 


Rev 


CCCGCTCGAG-ATCCTGCTCTT T T TTGCC * 


Xhol 




XT' - , , J 

rwa 


C I AGC 1 AGC- T GCGGGGGCGGCGGTGGCG 


Nhel 


Rev 


CCCGCTCGAG-ATCCTGCTCTTTTTTGCC 


Xhol 


Al 287-His 


rwa 


OO OOO A t*/~ , oo ot A o z -1 t~^/~* a nr/"* tt a a a r T , z"(z"> />n S 

COCOOA I CCGC 1 AGC-CCCG ATGTTAA ATCGfiC 3 


Nhel 


A2 287-His 


rwa 


CGCGGATCCGCTAGC-CAAGATATGGCGGCAGT 9 


Nhel 


A3 287-His 


rwa 


CGCGGATCCGCTAGC-GCCGAATCCGCAAATCA 9 


Nhel 


A .J WW* 

A4 287-His 


Fwd 


CGCGCTAGC-GGAAGGGTTGATTTGGCTAATGG 9 


Nhel 


A4 287MC58-His 


Fwd 


CGCGCTAGC-GGAAGGGTTGATTTGGCTAATGO 9 


Nhel 


287a-His 


Fwd 


CGCCATATG-TTTAAACGCAGCGTAATCGC 


Ndel 


Rev 


CCCGC 1 CGAG-AAAATTGCTACCGCCATTCGC AGG 


Xhol 


287b-His 


Fwd 


CGCCATATG-GGAAGGGTTGATTTGGCTAATGG 


Ndel 


287b-2996-His 


Rev 


CCCGCTCGAG-CTTGTCTTTATAAATGATGAGATATTTG 


Xhol 


237 D-IVlC5»-HlS 


Rev 


C(~^f~^t ' Z' I'l"/ " A ' A /" '1 " 1" 1 ' ATA A A A /~" A T A A T" A TP A TPTP/^ A A TTy^/^i 

CCCOC 1 CO AO- 1 I I A rAAAAGATAATATATTG ATTG ATTC.C. 


Xhol 


287e-2996-His 


Fwd 


CGCGCTAGC-ATGCCGCTGATTCCCGTCAATC § 


Nhel 




rwa 


C 1 AOC 1 AOC-GGGGGCGGCGGTGGCG 


Nhel 


Rev 


CCCGCTCGAG-TCAATCCTGCTCTTTTTTGGG 


Xhol 


AG287-His * 


Fwd 


CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC 


Nhel 


Rev 


CCCGCTCGAG-ATCCTGCTCTTTTTTGCC 


Xhol 


AG287K(2996) 


Fwd 


CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC 


Nhel 


Rev 


CCCGCTCGAG-TCAATCCTGCTCTTTTTTGCC 


Xhol 


AG 287-L 


Fwd 


CGCGGATCCGCTAGC- 

1 1 1 0/V/\l^OL^Avj 1 \J I KjPi l 1 ULAA 1 uut 1101A111J1 OCC 

CTTTCAGCCTGT TCGCCCGATGTTAAATCGGCG 


Nhel 


Rev 


CCCGCTCGAG-TCAATCCTGCTCTTTTTTGCC 


Xhol 


AG 287-Orf4L 


Fwd 


CGCGGATCCGCTAGC- 

AAAACCTTCTTCAAAACCCTTTCCGCCGCCGCACTCGCG 
CTCATCCTCGCCGCCTGC TCGCCCGATGTTAAATCG 


Nhel 


Pa*/ 
KcV 


LtLut 1 LuAu- J CAA 1 CC I OC I C 1 1 1 I I I OCC 


Xhol 




rwa 


COCOOA I CCCATATG-AAAACCAAGTTAATC A A A 


Ndel 


Rev 


CCCGCTCG AG-TTATTG ATTTTTGCGG ATG A 


Xhol 


JUo-I 


rwa 


COCOOA 1 CCCATATG-TTAAATCGGGTATTTTATC 


Ndel 


Rev 


CCCOC1COAO-I I AATCCGCC ATTCCCTG 


Xhol 


401L 


Fwd 


GCGGCCATATG-AAATTACAACAATTGGCTG 


Ndel 




Rev 


GCGGCCTCGAG-TTACCTTACGTTTTTCAAAG 


Xhol 


406L 


Fwd 


CGCGGATCCCATATG-CAAGCACGGCTGCT 


Ndel 




Rev 


CCCGCTCGAG-TCAAGGTTGTCCTTGTCTA 


Xhol 


502-1 L 


Fwd 


CGCGGATCCCATATG-ATGAAACCGCACAAC 


Ndel 




Rev 


CCCGCTCGAG-TCAGTTGCTCAACACGTC 


Xhol 



82 
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CA-l A /-|j: c rcT\ 
MP^-A ^MlS-Oo I ) 


Fwd 




RntriHl MH^I 
Dainrii-iNaci 






ivev 




.A.I1U1 


5 




rwa 


rr.rr.r, a Trrr a t a Tr, r.r a rr,nTrr,TT a t a r 


iNaei 






Rev 


rrrr.PTrr, a r. rTArrr,rr,rATTrrTr, 

LLLUL 1 L/VjAkj-v- 1 ALLuLuLA 1 1 K^\^ 1 O 


Anoi 




CIO 11 


Fwd 


r.pr.nrr a t a tp. r. a a tttttp a tt a tptthtt 

ULUuLLA 1 J\ 1 VJ-VJA/v 11111 CA 1 1 AIL 1 IvJl 1 


iNaei 






tvev 


r.rr.nrrTrr, a p ttatttppppp.ttttp.ptp.p 


yviiOl 


10 




rwa 


PPP.PPPATATr. A A P.T A TPTPPP.PTT A TTTTTP 


iNaei 






P,PP,P,PPTPPiAPi TTATPnOPTTHTriPA APP,P, 


.A.I101 






Fwd 


V^vJ V^VJVJ/\ 1 V_. V^VJ V-/ 1 /A vj V^ 1 V^Vw-VJ \J V^/A VJ V^/A/A/A/AV^ VJ/A 


Ram HI-TVJhel 

Dcllll n.1 






Raw 

ivev 


f.rpfA A PiPTT. A PPiP A PiTTPPtH A A TftP APi 
U^V-vAnUL/ I 1 -/\V^vJv^/YvJ 1 1 v^V-JVJ/A/A 1 VJIJ/WJ 


n iiiuiii 


15 


CM! 


Fwd 


nrCCtCC AT ATHTTfi A AT ATT A A APTPiA A A APPTTH 

UUL/VJV/V^A 1 A 1 \J 1 1 \Jr\r\ 1A1 1 AAAv 1 VJ/\rvr\/\v^V^ 1 1 U 






Rev 


GCCGCCTCGAGTTATTTCTGATGCCTTTTCCC 


Xhol 




556L 


Fwd 


GCCGCCATATGGACAATAAGACCAAACTG 


Ndel 






Rev 


GCCGCCTCGAGTTAACGGTGCGGACGTTTC 


Xhol 


20 


557L 


Fwd 


CGCGGATCCCATATG-AACAAACTGTTTCTTAC 


Ndel 




Rev 


CCCGCTCGAG-TCATTCCGCCTTCAGAAA 


Xhol 




564ab-(His/GST) 
(MC58) 


Fwd 


CGCGGATCCCATATG- 

CAAGGTATCGTTGCCGACAAATCCGCACCT 


BamHI-Ndel 


25 


Rev 


CCCGCTCGAG- 

AGCTAATTGTGCTTGGTTTGCAGATAGGAGTT 


Xhol 




564abL (MC58) 


Fwd 


CGCGGATCCCATATG- 

AACCGCACCCTGTACAAAGTTGTATTTAACAAACATC 


Ndel 






Rev 


CCCGCTCGAG- 

TTAAGCTAATTGTGCTTGGTTTGCAGATAGGAGTT 


Xhol 


30 


564b- 
(His/GST)(MC58) 


Fwd 


CGCGGATCCCATATG- 

ACGGGAGAAAATCATGCGGTTTCACTTCATG 


BamHI-Ndel 






Rev 


CCCGCTCGAG- 

AGCTAATTGTGCTTGGTTTGCAGATAGGAGTT 


Xhol 


35 


564c- 
(His/GST)(MC58) 


Fwd 


CGCGGATCCCATATG- 

GTTTCAGACGGCCTATACAACCAACATGGTGAAATT 


BamHI-Ndel 






Rev 


CCCGCTCGAG- 

GCGGTAACTGCCGCTTGCACTGAATCCGTAA 


Xhol 




564bc- 
(His/GST)(MC58) 


Fwd 


CGCGGATCCCATATG- 

ACGGGAGAAAATCATGCGGTTTCACTTCATG 


BamHI-Ndel 


40 




Rev 


CCCGCTCGAG- 

GCGGTAACTGCCGCTTGCACTGAATCCGTAA 


Xhol 




564d- 
(His/GST)(MC58) 


Fwd 


CGCGGATCCCATATG- 

CAAAGCAAAGTCAAAGCAGACCATGCCTCCGTAA 


BamHI-Ndel 


45 




Rev 


CCCGCTCGAG- 

TCTTTTCCTTTCAATTATAACTTTAGTAGGTTCAATTTTG 
GTCCCC 


Xhol 




30'ica- 
(His/GST)(MC58) 


rwa 


nncciCi a Trrr a t a tci 

U\J\^UvJ/\ 1 V/V^^A 1 /V 1 VJ- 

GTTTCAGACGGCCTATACAACCAACATGGTGAAATT 


Dd.inni-i>uci 


50 




Rev 


CCCGCTCGAG- 

TCTTTTCCTTTCAATTATAACTTTAGTAGGTTCAATTTTG 
GTCCCC 


Xhol 




570L 


Fwd 


GCGGCCATATG-ACCCGTTTGACCCGCG 


Ndel 






Rev 


GCGGCCTCGAG-TCAGCGGGCGTTCATTTCTT 


Xhol 




576-1L 


Fwd 


CGCGGATCCCATATG-AACACCATTTTCAAAATC 


Ndel 


55 




Rev 


CCCGCTCGAG-TTAATTTACTTTTTTGATGTCG 


Xhol 



83 



EP 1 790 660 A2 







Fwd 


nrnr*rc at atPi fi a TTrnrrr a a nfiTrrifi 


1NUCI 






Rev 


GCGGCCTCGAG-CTACACTTCCCCCGAAGTGG 


Xhol 


5 


583L 


Fwd 


CGCGGATCCCATATG-ATAGTTGACCAAAGCC 


Ndel 






Rev 


CCCGCTCGAG-TTATTTTTCCGATTTTTCGG 


Xhol 




593 


Fwd 


GCGGCCATATG-CTTGAACTGAACGGACT 


Ndel 






Rev 


GCGGCCTCGAG-TCAGCGGAAGCGGACGATT 


Xhol 


10 


650 (His-GST) 
(MC58) 


Fwd 


CGCGGATCCCATATGTCCAAACTCAAAACCATCG 


BamHI-Ndel 






Rev 


pppnr'Tpr! a nnrTTrr a a tp a r.TTTr, a pp 
LLLUL 1 OOAOOO I 1 CL AA J LAU 1 1 1UALL 


Anol 




652 


rWd 


a t a tp appppa a tppttp. a t a ttttp 
uLuuLLA 1 A 1 O-AOOOOAA I Lu 1 1 OA I A J 1 1 1L 


Ndel 






Rev 


f>/-^ p»^tpp a /"> tt a ' i " i " i *p pp p a p ttp p t a p a a TP" 1 
GCGGCC 1 OOAO- 1 I A 1 1 1 OLCLAO 1 1 uu I AO AA 1 0 


Anol 


15 


664L 


rwd 


/-~« (~>r* r*(-* A TP A TP PTP A T A P A TP pr 1 P A PT A ^TTr 1 

uLuuLLA 1 A 1 O-O 1 OA 1 ALA 1 LLuLAL 1 AO 1 1 O 


Ndel 






Rev 


GCGGCCTCGAG-TCAAAATCGAG t tttacacca 


Xhol 




726 


r,.,J 

rwd 


p pp p pp a tp a tp a pp \ tpt a tttp a a a a a ppp 
OOOOOOA 1 A J O-AOOA 1 O 1 A 1 1 1 OAAAAAOOO 


Ndel 






Rev 


p r^t~^ r V(~ i /~~' a p tp a ST^r^c* a tpttt a ppp tpp \ tt 
OOOOOO 1 OOAO- 1 OAOOOOA I O 1 1 1 AOOO 1 OOA I 1 


Anol 


20 


74l-His(MC58) 


rwd 


OOOOOA I OOOA 1 A J O-AOOAOOOOAOOOOO J O 


Ndel 






Rev 


PPPPPTPP AP TTPPTTPPPPPP A APPP 

OOOOO 1 OOAO- I 1 OO 1 lOOOOOOAAOOO 


Anol 




AG741-His(MC58) 


rwd 


f* 1 a Tprr a t a tp p Trr (~*r^ c*(~*r\ apa tpp 
OOOOOA 1 OOOA 1 A I O-O 1 OOOOOOOOAOA 1 OO 


Ndel 






Rev 


c^r^r^r^ ptpp a p ttppttpppppp a a ppp 
OOOOO 1 OOAO- I 1 OO 1 1 OOOOOOAAOOO 


Anol 


25 


686-2-( H ls/G ST) 


Fwd 


/"< /^-> /"</"• A TPPP ATA TP P P PP P ' 1 " 1 'PP P A A P P"* P'P^ 

OOOOOA 1 OOOA 1 A 1 0-OOOOO i 1 OOOAAOOOO 


BamHI-Ndel 




SUM 0\ 

(JVIC58) 


Rev 


p-'P'P-p^P'Tpp a p ttp a a p a ptp a tptpttttppp a 
OOOOO I OOAO- 1 1 OAAOAO 1 OA 1 O 1 O 1 1 1 1 OOOA 


Anol 




719-(His/GST) 


rwd 


PPPPP A TPPPPT A PP A A A PTP TPP TTP P TP TT A A P 

OOOOOA 1 OOOO 1 AOO-AAAO 1 O 1 OO 1 1 OO I O 1 1 AAO 


DnmUl \ I U , I 

tsamHi-Nnei 




(MC58) 


Rev 


p^p'pp- 1 ptpp a p tp' i 'P* a rrrrPTrr 1 a ppp 
OOOOO 1 OOAO- 1 1 OAOOOOO I OOAOOO 


Anol 


30 


730-His (MC58) 


Fwd 


PPPPPP ATA TP P P Z" 1 A PTTPPPPP A A P A PPP 

GCCGCCATATGGCGGACTTGGCGC AAO AOOO 


Ndel 






Rev 


GCCGCCTCGAGATCTCCTAAACCTGTTTTAACAATGCCG 


Xhol 




730A-His (MC58) 


Fwd 


OOOOOOA 1 A 1 OOOOO AO 1 1 OOOOOAAOAOOO 


Ndel 






Rev 


GCGGCCTCGAGCTCCATGCTGTTGCCCCAGC 


Xhol 


35 


73UB-H1S (MC58) 


rwd 


PPPPPPATA TP r^fT^f^ APTTPPPPPA AP A PPP 

OOOOOOA 1 A 1 OOOOO AO I 1 OOOOOAAOAOOO 


Ndel 




Rev 


GCGGCCTCGAGAAAATCCCCGCTAACCGCAG 


Anol 




741 -His 


Fwd 


PP PP P A TPPP ATA TP A P P A P PP PAP P P P PTP 

OOOOOA 1 OOOA 1 A 1 O-AOOAOOOOAOOOOO 1 O 


Ndel 




(MC58) 


Rev 


CCCGCTCGAG-TTGCTTGGCGGCAAGGC 


Xhol 


40 


AG741-His 


Fwd 


CGCGGATCCCATATG-GTCGCCGCCGACATCG 


Ndel 


(MC58) 


Rev 


CCCGCTCGAG-TTGCTTGGCGGCAAGGC 


Xhol 




743 (His-GST) 


Fwd 


CGCGGATCCCATATGGACGGTGTTGTGCCTGTT 


BamHI-Ndel 






Rev 


CCCGCTCGAGCTTACGGATCAAATTGACG 


Xhol 


45 


757 (His-GST) 
(MC58) 


Fwd 


CGCGGATCCCATATGGGCAGCCAATCTGAAGAA 


BamHI-Ndel 






IxCV 


rrpnrTpn AnrTP AriPTTTTrTppriTPA a 

V^V^l^VJLv 1 V^VJAVUV^ 1 ^//\VJl_* 1111 \J\_^\J 1 V^r\/\ 


Xhnl 
yviiui 




/ 3y-ri IS/03 1 


r wa 


PPPnfl A TPPPPT A PP TAPTP A TPP ATTPTPPPP 


Dull lnl" !M 11C1 




(MC58^ 


Rev 


CCCGCTCGAG-CCAGTTGTAGCCTATTTTG 


Xhol 


50 


759L 


Fwd 


CGCGGATCCGCTAGC-ATGCGCTTCACACACAC 


Nhel 




(MC58) 


Rev 


CCCGCTCGAG-TTACCAGTTGTAGCCTATTT 


Xhol 




760-His 


Fwd 


GCCGCCATATGGCACAAACGGAAGGTTTGGAA 


Ndel 






Rev 


GCCGCCTCGAGAAAACTGTAACGCAGGTTTGCCGTC 


Xhol 


55 


769-His (MC58) 


Fwd 


GCGGCCATATGGAAGAAACACCGCGCGAACCG 


Ndel 



84 



EP 1 790 660 A2 







Rev 


GCGGCCTCGAGGAACGTTTTATTAAACTCGAC 


Xhol 




907L 


Fwd 


GCGGCCATATG-AGAAAACCGACCGATACCCTA 


Ndel 






Rev 


GCGGCCTCGAG-TCAACGCCACTGCCAGCGGTTG 


Xhol 




9I1L 


Fwd 


CGCGGATCCCATATG-AAGAAGAACATATTGGAATTTTGGGTCGGACTG 


Ndel 






Rev 


CCCGCTCGAG-TTATTCGGCGGCTTTTTCCGCATTGCCG 


Xhol 


10 


911LOmpA 


Fwd 


GGGAATTCCATATGAAAAAGACAGCTATCGCGATTGCA 
GTGGCACTGGCTGGTTTCGCTACCGTAGCGCAGGCCGC 
TAGC-GCTTTCCGCGTGGCCGGCGGTGC 


Ndel-(NheI) 






Rev 


CCCGCTCGAG-TTATTCGGCGGCTTTTTCCGCATTGCCG 


Xhol 




911LPelB 


Fwd 


CATGCCATGG-CTTTCCGCGTGGCCGGCGGTGC 


Ncol 






Rev 


CCCGCTCGAG-TTATTCGGCGGCTTTTTCCGCATTGCCG 


Xhol 


15 


91 3- His/GST 


Fwd 


CGCGGATCCCATATG-TTTGCCGAAACCCGCC 


BamHI-Ndel 




(MC58) 


Rev 


CCCGCTCGAG-AGGTTGTGTTCCAGGTTG 


Xhol 




913L 


Fwd 


CGCGGATCCCATATG-AAAAAAACCGCCTATG 


Ndel 




(MC58) 


Rev 


CCCGCTCGAG-TTAAGGTTGTGTTCCAGG 


Xhol 


20 


919L 


Fwd 


CGCGGATCCCATATG-AAAAAATACCTATTCCGC 


Ndel 






Rev 


CCCGCTCGAG-TTACGGGCGGTATTCGG 


Xhol 




919 


Fwd 


CGCGGATCCCATATG-CAAAGCAAGAGCATCCAAA 


Ndel 






Rev 


CCCGCTCGAG-TTACGGGCGGTATTCGG 


Xhol 


25 


91 9L Orf4 


Fwd 


GGGAATTCCATATGAAAACCTTCTTCAAAACCCTTTCCG 

CCGCCGCGCTAGCGCTCATCCTCGCCGCC- 

TGCCAAAGCAAGAGCATC 


NdeI-(NheI) 






Rev 


CCCGCTCGAG-TTACGGGCGGTATTCGGGCTTCATACCG 


Xhol 




(91 9>287 fusion 


Fwd 


CGCGGATCCGTCGAC-TGTGGGGGCGGCGGTGGC 


Sail 


30 




Rev 


CCCGCTCGAG-TCAATCCTGCTCTTTTTTGCC 


Xhol 




920-1 L 


Fwd 


GCGGCCATATG-AAGAAAACATTGACACTGC 


Ndel 






Rev 


GCGGCCTCGAG-TTAATGGTGCGAATGACCGAT 


Xhol 


35 


925-H is/GST 
(MC58) <!ATE 


Fwd 


ggggacaagtttgtacaaaaaagcaggctTGCGGCAAGGATGCCGG 


attBl 




Rev 


ggggaccactttgtacaagaaagctgggtCTAAAGCAACAATGCCGG 


cmB2 




926L 


Fwd 


CGCGGATCCCATATG-AAACACACCGTATCC 


Ndel 






Rev 


CCCGCTCGAG-TTATCTCGTGCGCGCC 


Xhol 




92 7-2-(H is/GST) 


Fwd 


CGCGGATCCCATATG-AGCCCCGCGCCGATT 


BamHI-Ndel 


40 


(MC58) 


Rev 


CCCGCTCGAG-TTTTTGTGCGGTCAGGCG 


Xhol 




932-His/GST 


Fwd 


ggggacaagtttgtacaaaaaagcaggctTGTTCGTTTGGGGGATTTAA 
ACCAAACCAAATC 


attBl 




935 (His-GST) 
(MC58) 


For 


CGCGGATCCCATATGGCGGATGCGCCCGCG 


BamHI-Ndel 




Rev 


CCCGCTCGAGAAACCGCCAATCCGCC 


Xhol 






Rev 


ggggaccacmgtacaagaaagctgggtTCATTTTGTTTTTCCTTCTTCT 
CGAGGCCATT 


attB2 




yjO-I Li 


r WQ 




NHpI 

1NUCI 


50 




Rev 


CCCGCTCGAG-TCAGCGTTGGACGTAGT 


Xhol 




953L 


Fwd 


GGGAATTCCATATG-AAAAAAATCATCTTCGCCG 


Ndel 






Rev 


CCCGCTCGAG-TTATTGTTTGGCTGCCTCGAT 


Xhol 




953-fu 


Fwd 


GGGAATTCCATATG-GCCACCTACAAAGTGGACG 


Ndel 


55 




Rev 


CGGGGATCC-TTGTTTGGCTGCCTCGATTTG 


BamHI 



85 



EP 1 790 660 A2 





954 (His-GST) 
(MC58) 


Fwd 


CGCGGATCCCATATGCAAGAACAATCGCAGAAAG 


BamHI-Ndel 






Rev 


CCCGCTCGAGTTTTTTCGGCAAATTGGCTT 


Xhol 


5 


958-His/GST 
(MC58) CATK 


Fwd 


ggggacaagtttgtacaaaaaagcaggctGCCGATGCCGTTGCGG 


anBl 






Rev 


ggggaccactttgtacaagaaagctgggt I CAOOO 1 tu 1 I I C I 1 GOG 


attB2 




961 L. 


Fwd 


o o o o o a tooo a t a to a a a o a ptttpp a too 
OOOOOA 1 LLLA 1 A 1 0-AAAtAt 1 1 1 CCA 1 CO 


IN del 


10 




Rev 


oy^oo otoo a o tt a oo a otoot a a ttp a o 
OOOOO l CO AO- 1 I AtCAt 1 CO 1 AA i 1 O AO 


AnOl 




961 


T?,. ,A 

rWQ 


r^r^r^r^r^ a tooo a t a to o oo a a a ooo a oo a o 
OOOOOA I OOOA 1 A I O-UCCACAAUCUACOAC 


Ndel 






Rev 


r^{~^ r^r^ r^"~vr^ a o tt a oo a ptpp.t a a ttp. a o 
OOOOO 1 OO AO- 1 i AOO AO 1 OO I AA 1 1 O AO 


Anol 




961 c (His/GST) 


Fwd 


oo o o o a tooo a t a to ooo a o a a a oo a oo 
OOOOO A 1 OOOA 1 A 1 0-OOOAOAAAOOAOO 


BamHI-Ndel 


15 




Rev 


/"'/"'OO /"^TOO A A f~*/~*t~* AOO T'T'/^"!^ A A OOTT^O 

COOOO T OOAO-AOOOAOO 1 J O I A AOO I Tu 


Xnol 




961 c-(His/GST) 


r wd 


r^r^r^r^r^ a tooo a t a to o oo a o a a o oo a r^r** a oo a 
OOOOO A 1 OOOA I A I O-OOOAOAAOOOAOOAOO A 


n nm ui XI A^l 

BamHI-Ndel 




(JV1C58) 


Rev 


r^r^r^iT^T^r^r^ a a r^r^r^ a r~ y r~* r T* r Tf~^T a a o otto 
OOOOO I OOAO-AOOOAOO I lul AAOO 1 1 O 


Xnol 




961 c-L 


Fwd 


^•r^ ooo a tooo a t a to a to a a a o a rTTTrr a too 
OOOOO A 1 OOOA 1 A 1 O-A 1 OAAAOAO 1 1 1 OOA 1 OO 


Ndel 


20 




Rev 


COCO CTOOAO-TTAAOOO AOO 1 I O I AAOO I 


Xnol 




961 c-L 


X?.. ,a 

r wd 


O O O O O A TOOO ATA TO A TO A A A O A OTTTOO A TOO 

OOOOOA 1 OOOA I A 1 O-A 1 OAAAOAO 1 1 1 OOA 1 OO 


XI ^1 ^vT 

Ndel 




(MC58) 


Rev 


COOOCTOOAO-TTAAOOOAOOTI OTAAOO I 


Xhol 




961 d (His/GST) 


. ,A 

r wd 


OOOOO A TOOO ATA TO OOO A O A A A OO A OO 

OOOOOA 1 OOOA 1 A 1 O-OOOAOAAAOOAOO 


BamHI-Ndel 


25 




Rev 


CCCGCTCGAG-GTCTGACACTGTTTTATCC 


Xhol 




961 Al-L 


Fwd 


CGCGGATCCCATATG-ATGAAACACTTTCCATCC 


Ndel 






Rev 


PPPP PTPT 1 A TT A TO OTTTOO OO O O A A AO 

OOOOO 1 OOAO- 1 1 A 1 OO 1 1 J OOOOOOAAAO 


Xnol 




fu 961-... 


Fwd 


CGCGGATCCCATATG- GCCACAAACGACGAC 


Ndel 


30 




Rev 


CGCGGATCC-CCACTCGTAATTGACGCC 


BamHI 


fu 961-... 


Fwd 


CGCGGATCCCATATG-GCCACAAGCGACGAO 


Ndel 




(MC58) 


Rev 


CGCGGATCC-CCACTCGTAATTGACGCC 


BamHI 




fu 961 c-... 


Fwd 


CGCGGATCCCATATG-GCCACAAACGACGAC 


Ndel 


35 




Rev 


CGCGGATCC -ACCCACGTTGTAAGGTTG 


BamHI 


fu 961 c-L-... 


Fwd 


CGCGGATCCCATATG- ATGAAACACTTTCCATCC 


Ndel 






Rev 


CGCGGATCC -ACCCACGTTGTAAGGTTG 


BamHI 




fu (961 )- 
741(MC58)-His 


Fwd 


CGCGGATCC -GGAGGGGGTGGTGTCG 


BamHI 


40 




Rev 


CCCGCTCGAG-TTGCTTGGCGGCAAGGC 


Xhol 




fu (961 )-983-His 


Fwd 


CGCGGATCC - GGCGGAGGCGGCACTT 


BamHI 






Rev 


CCCGCTCGAG-GAACCGGTAGCCTACG 


Xhol 


45 


fu (961)- Orf46.1- 
His 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 
TCAGATTTGGCAAACGATTC 


BamHI 




Rev 


CCCGCTCGAG-CGTATCATATTTCACGTGC 


Xhol 




fu (961 c-L)- 
741(MC58) 


Fwd 


CGCGGATCC -GGAGGGGGTGGTGTCG 


BamHI 






Rev 


CCCGCTCGAG-TTATTGCTTGGCGGCAAG 


Xhol 


50 


fu (961 c-L )-983 


Fwd 


CGCGGATCC - GGCGGAGGCGGCACTT 


BamHI 






Rev 


CCCGCTCGAG-TCAGAACCGGTAGCCTAC 


Xhol 




fu (961c-L)- 
Orf46.1 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 
TCAGATTTGGCAAACGATTC 


BamHI 


55 




Rev 


CCCGCTCGAG-TTACGTATCATATTTCACGTGC 


Xhol 


961 (His/GST) 


Fwd 


CGCGGATCCCATATG-GCCACAAGCGACGACG 


BamHI-Ndel 



86 



EP 1 790 660 A2 





(MC58) 


Rev 


LLLOl I LUAO-ttAL 1 LG 1 AA 1 1 UALuLL 


Anol 




961 Al-His 


r?, , ,A 

rWd 


LuLUUA t LLLA I A 1 0-utLALAAAHjAtuAL 


Noel 


5 




Rev 


CCCGCTCGAG-TGCTTTGGCGGCAAAGTT 


Xhol 




961a-(His/GST) 


Fwd 


CGCGGATCCCATATG-GCCACAAACGACGAC 


BamHI-Ndel 






Rev 


CCCGCTCGAG-TTTAGCAATATTATCTTTGTTCGTAGC 


Xhol 




961b-(His/GST) 


Fwd 


CGCGGATCCCATATG-AAAGCAAACCGTGCCGA 


BamHI-Ndel 


10 




Rev 


CCCGCTCGAG-CCACTCGTAATTGACGCC 


Xhol 


961 -His/GST GATE 


Fwd 


ggggacaagtttgtacaaaaaagcaggctGCAGCCACAAACGACGACG 
ATGTTAAAAAAGC 


cut Bl 






Rev 


ggggaccactttgtacaagaaagctgggtTTACCACTCGTAATTGACGC 
CGACATGGTAGG 


attB2 


15 


982 


r~.. .A 

Fwd 


GLGGLL A I A 1 (j*ULAuLAAAAuAtu 1 AC AO 1 I 


iNciei 






Rev 


/~* /"^ /— ' /""'/"IT 1 /""' /*> A /"""» ' 1 " ■ ' A A TP /"I A TP/"' f^/""*/^ f~*i~^i~* A TP A /™1 /"S A 

GCGGCCTCGAG-TTACATCATGCCGLLLA I ALLA 


Xhol 




983-His (2996) 


C, . ,A 

rwd 


tuLuuA I ttUL 1 AUt- I 1 AuuLUULUULuuAu 


iNnei 






Rev 


CClGCTlGAG-GAAllGG I AGLL I ALG 


Anol 


20 


AG983-His (2996) 


r„,j 

rwd 


LLLL I AuL I AUL-AL 1 1 Li OtutttUAL I I 


iNnei 




Rev 


CCCGCTCGAG-GAACCGGTAGCCTACG 


Xhol 




983-His 


Fwd 


CGCGGATCCGCTAGC-TTAGGCGGCGGCGGAG 


Nhel 






Rev 


CCCGCTCGAG-GAACCGGTAGCCTACG 


Xhol 


25 


AG983-His 


Fwd 


CGCGGATCCGCTAGC-ACTTCTGCGCCCGACTT 


Nhel 




Rev 


CCCGCTCGAG-GAACCGGTAGCCTACG 


Xhol 




983L 


Fwd 


CGCGGATCCGCTAGC- 

CGAACGACCCCAACCTTCCCTACAAAAACTTTCAA 


Nhel 






Rev 


LLLOL I LGAG- 1 LAGAALLGALG I (jLLAAvjCLu 1 1 L 


XnOl 


30 


987-His (1VIC58) 


Fwd 


ULtOLtA 1 A I uLttttAL 1 uOAAuAAtuuALO 


Noei 






Rev 


fT^ A O T A ATA A A /~ , f~ , T _ T"/~' A TP/" 1 i^ 1 /~* A /~ > /~' A 

ULtOLL i LuAu 1 AA I AAAtt I J CI A 1 uuutAutAu 


VI,aI 

Anol 




989-(His/G!ST) 


Fwd 


LuLuuA I LttA 1 A 1 O- 1 Ltu 1 LLALULA 1 Ltvj 


DairiMi-iNQei 




(MC58) 


Rev 


LLLOL 1 LUAO- 1 J 1 GAA I 1 1 G 1 AUO lulAl lU 


Anol 


35 


989L 


Fwd 


LOLUu A I LLLA 1 A 1 0-ALLLL J I LLULAL 1 


Ndel 




(MC58) 


Rev 


CCCGCTCG AG-TTATT1 GAA J I I G I AGO 1 G J A I 


Anol 




CrgA-His 


C, , ,A 

rwd 


t~*/~* r^r^ a tppp a t a t/^* a a a a c^f^ a a i 'it 1 a n a a a a 
LuLOuA 1 LLLA 1 A 1 0-AAAALLAA J 1 LAuAAuAA 


rsaei 




(MC58) 


Rev 


LLLGl I LGAG- I LLAlAGAGA 1 I G I 1 1 LL 


Anol 


40 


i>: i it c 
rilLl-ko 


Fwd 


OA 1 OLLLO AAOOOLOOO 






(MC58) 


Rev 


r^> (~* A A riTT TP/"" 1 A/^A A / ^ A AOA f""' , 'l""I"/"' A i~^f~^ f" 1 

OLllAAOL 1 I - 1 LAOAAOAAOAL 1 1 LALOL 






rllLl-rilS 


Fwd 


mf^dC ATrrPATATH PA A Appr ATA A ATATflPTATT 
LOLOO A 1 LLLA 1 A 1 Vj-L/\r\ALLL/Y 1 AAA 1 ALOL 1 A 1 I 


iNaei 




(MC58) 


Rev 


/-> (~*r^{~* A A /^OT'T A A/^A A/^A PTTP A r^f^C^f^ A 

OLLLAAOL 1 1-OAAOAAOAL1 1LALOLLAO 


Hindin 


45 


AlPiICl-His 


rWd 


c s r\c y r\r\ a tppp ata r.TPTTTTTrr: a r a ATArrnA 

LOLOO A 1 LLLA i A 1 O-O 1 L 1 1 1 i 1 LOALAA 1 ALLOA 


iNaei 




(MC58) 


Rev 


OLLLAAOL 1 i - 


Minalli 




r llLl L 


r wq 


ppprp, ATPrPATATH A ATA A A APTTTA AAA AP,r.rr,r. 
LULUUA 1 LLLA 1 A 1 u-AA 1 AAAAL 111 AAAAAUULUU 


IN del 




(IVIlSo) 


Rev 


C^C^Cr* A A r.PTT TP AT.A AHA AHA PTTP A CCIC 1 
UULLAAUL 11-1 LAUAAUAAUAt 1 1 LALUL 


rtinaiii 


50 


ALr i DpZ-1 1 IS 


Fwd 


CGCG A ATCCC AT ATG-TTPCi ATCTTGATTCTCiTCCi A 


Ndel 


(MC58) 


Rev 


CCCGCTCGAG-TCGCACAGGCTGTTGGCG 


Xhol 




Tbp2-His 


Fwd 


CGCGAATCCCATATG-TTGGGCGGAGGCGGCAG 


Ndel 




(MC58) 


Rev 


CCCGCTCGAG-TCGCACAGGCTGTTGGCG 


Xhol 


55 


Tbp2-His(MC58) 


Fwd 


CGCGAATCCCATATG-TTGGGCGGAGGCGGCAG 


Ndel 




Rev 


CCCGCTCGAG-TCGCACAGGCTGTTGGCG 


Xhol 



87 



EP 1 790 660 A2 





NMB0109- 
(His/GST) 


Fwd 


CGCGGATCCCATATG-GCAAATTTGGAGGTGCGC 


BamHI-Ndel 




(MC58) 


Rev 


CCCGCTCGAG-TTCGGAGCGGTTGAAGC 


Xhol 


5 


NMB01Q9L 


Fwd 


CGCGGATCCCATATG-CAACGTCGTATTATAACCC 


Ndel 




(MC58) 


Rev 


CCCGCTCGAG-TTATTCGGAGCGGTTGAAG 


Xhol 




NMB0207- 
(His/GST) 


Fwd 


CGCGGATCCCATATG- 
GGCATCAAAGTCGCCATCAACGGCTAC 


BamHI-Ndel 


10 


(MC58) 


Rev 


CCCGCTCGAG-TTTGAGCGGGCGCACTTCAAGTCCG 


Xhol 




NMB0462- 
(H is/GST) 


Fwd 


CGCGGATCCCATATG-GGCGGCAGCGAAAAAAAC 


BamHI-Ndel 




(MC58) 


Rev 


CCCGCTCGAG-GTTGGTGCCGACTTTGAT 


Xhol 


15 


NMB0623- 
(H is/GST) 


Fwd 


CGCGGATCCCATATG-GGCGGCGGAAGCGATA 


BamHI-Ndel 




(MC58) 


Rev 


CCCGCTCGAG-TTTGCCCGCTTTGAGCC 


Xhol 




GST)(MC58) 


Fwd 




Damrii-iNaei 


20 




Rev 


CCCGCTCGAGCATCCCGTACTGTTTCG 


Xhol 




NMB0634 
(His/GST)(IV1C58) 


Fwd 


ggggacaagtttgtacaaaaaagcaggctCCGACATTACCGTGTACAAC 
GGCCAACAAAGAA 


atrBl 






Rev 


ggggaccactttgtacaagaaagctgggtCTTATTTCATACCGGCTTGCT 
CAAGCAGCCGG 


auB2 


25 


NMB0776- 
His/GST (MC58) 

GATE 


Fwd 
Rev 


ggggacaagtttgtacaaaaaagcaggctGATACGGTGTTTTCCTGTAA 
AACGGACAACAA 

ggggaccactttgtacaagaaagctgggtCTAGGAAAAATCGTCATCGT 
TGAAATTCGCC 


atrBl 
attB2 


30 


NMB1115- 
His/GST (MC58) 


Fwd 
Rev 


ggggacaagtttgtacaaaaaagcaggctATGCACCCCATCGAAACC 
ggggaccactttgtacaagaaagctgggtCTAGTCTTGCAGTGCCTC 


attBI 
attB2 




NMB1343- 
(His/GST) 


Fwd 


CGCGGATCCCATATG- 
GGAAATTTCTTATATAGAGGCATTAG 


BamHI-Ndel 


35 


(MC58) 


Rev 


CCCGCTCGAG- 

GTTAATTTCTATCAACTCTTTAGCAATAAT 


Xhol 




NMB1369 (His- 
GST (MC58) 


Fwd 


CGCGGATCCCATATGGCCTGCCAAGACGACA 


BamHI-Ndel 






Rev 


CCCGCTCGAGCCGCCTCCTGCCGAAA 


Xhol 


40 


NMB1551 (His- 
GST)(MC58) 


Fwd 


CGCGGATCCCATATGGCAGAGATCTGTTTGATAA 


BamHI-Ndel 






Rev 


CCCGCTCGAGCGGTTTTCCGCCCAATG 


Xhol 




NMB1899 (His- 
GST) (MC58) 


Fwd 


CGCGGATCCCATATGCAGCCGGATACGGTC 


BamHI-Ndel 


45 




Rev 


CCCGCTCGAGAATCACTTCCAACACAAAAT 


Xhol 


NMB2050- 
(His/GST) 


Fwd 


CGCGGATCCCATATG-TGGTTGCTGATGAAGGGC 


BamHI-Ndel 




(MC58) 


Rev 


CCCGCTCGAG-GACTGCTTCATCTTCTGC 


Xhol 




NMB2050L 


Fwd 


CGCGGATCCCATATG-GAACTGATGACTGTTTTGC 


Ndel 


50 


(MC58) 


Rev 


CCCGCTCGAG-TCAGACTGCTTCATCTTCT 


Xhol 




NMB2159- 
(His/GST) 


Fwd 


CGCGGATCCCATATG- 

AGCATTAAAGTAGCGATTAACGGTTTCGGC 


BamHI-Ndel 


55 


(MC58) 


Rev 


CCCGCTCGAG- 

GATTTTGCCTGCGAAGTATTCCAAAGTGCG 


Xhol 


fu-AG287...-His 


Fwd 


CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC 


Nhel 
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Rev 


CGGGGATCC-ATCCTGCTCTTTTTTGCCGG 


BamHI 




fu-(AG287)-919- 
His 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 
CAAAGCAAGAGCATCCAAACC 


BamHI 


5 




Rev 


CCCAAGCTT-TTCGGGCGGTATTCGGGCTTC 


Hindlll 




fu-(AG287)-953- 
His 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 
GCCACCTACAAAGTGGAC 


BamHI 






Rev 


GCCCAAGCTT-TTGTTTGGCTGCCTCGAT 


Hindlll 


10 


fu-(AG2S7)-961- 


Fwd 


CGCGGATCCGGTGGTGGTGGT-ACAAGCGACGACG 


BamHI 




His 


Rev 


GCCCAAGCTT-CCACTCGTAATTGACGCC 


Hindlll 




fu-(AG287)- 
Orf46.1-His 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 
TCAGATTTGGCAAACGATTC 


BamHI 


15 




Rev 


CCCAAGCTT-CGTATCATATTTCACGTGC 


Hindlll 


fu-(AG287-919)- 
Orf46.1-His 


Fwd 


CCCAAGCTTGGTGGTGGTGGTGGT- 
TCAGATTTGGCAAACGATTC 


Hindlll 






Rev 


CCCGCTCGAG-CGTATCATATTTCACGTGC 


Xhol 


20 


fu-(AG287- 
Orf46.1)-919-His 


Fwd 


CCCAAGCTTGGTGGTGGTGGTGGT- 
CAAAGCAAGAGCATCCAAACC 


Hindlll 




Rev 


CCCGCTCGAG-CGGGCGGTATTCGGGCTT 


Xhol 




fu AG287(394.98)- 


Fwd 


CGCGGATCCGCTAGC-CCCGATGTTAAATCGGC 


Nhel 


25 




Rev 


CGGGGATCC-ATCCTGCTCTTTTTTGCCGG 


BamHI 


fu Orfl-(Orf46.1)- 


Fwd 


CGCGGATCCGCTAGC-GGACACACTTATTTCGGCATC 


Nhel 




His 


Rev 


CGCGGATCC-CCAGCGGTAGCCTAATTTGAT 






fu (Orfl)-Orf46.1- 
His 


Fwd 


CGCGGATCCGGTGGTGGTGGT- 
TCAGATTTGGCAAACGATTC 


BamHI 


30 




Rev 


ClLAAOL I 1 -CO 1 A 1 LA 1 A 1 1 1 LALu 1 UL 


Hindlll 




fu (919)-Orf46.1- 


Fwdl 


GCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAG 


Sail 




His 


Fwd2 


uCjACjOCAC I CjCjA I CC 1 CACjA 1 1 J uOLAAALuA 1 I C 








Rev 


CCCGCTCGAG-CGTATCATATTTCACGTGC 


Xhol 


35 


Fu orf46-.... 


Fwd 


GGAATTCCATATGTCAGATTTGGCAAACGATTC 


Ndel 






Rev 


CGCGGATCCCGTATCATATTTCACGTGC 


BamHI 




Fu (orf46)-287-His 


Fwd 


CGGGGATCCGGGGGCGGCGGTGGCG 


BamHI 






Rev 


CCCAAGCTTATCCTGCTCTTTTTTGCCGGC 


Hindlll 


40 


Fu (orf46)-919-His 


Fwd 


CGCGGATCCGGTGGTGGTGGTCAAAGCAAGAGCATCCA 
AACC 


BamHI 






Rev 


CCCAAGCTTCGGGCGGTATTCGGGCTTC 


Hindlll 




Fu (orf46-919)- 
287-His 


Fwd 


CCCCAAGCTTGGGGGCGGCGGTGGCG 


Hindlll 


45 




Rev 


CCCGCTCGAGATCCTGCTCTTTTTTGCCGGC 


Xhol 




Fu (orf46-287)- 
919-His 


Fwd 


CCCAAGCTTGGTGGTGGTGGTGGTCAAAGCAAGAGCAT 
CCAAACC 


Hindlll 






Rev 


CCCGCTCGAGCGGGCGGTATTCGGGCTT 


Xhol 


50 


(AG741 )-961c-His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCGCAGCCACAAACGACGACGA 
GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG 


Xhol 






Rev 


CCCGCTCGAG-ACCCAGCTTGTAAGGTTG 


Xhol 




(AG741 )-961-His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCGCAGCCACAAACGACGACGA 
GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG 


Xhol 


55 




Rev 


CCCGCTCGAG-CCACTCGTAATTGACGCC 


Xhol 
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(AG741 )-983-His 


Fwd 


GCGGCCTCGAG- 

GGATCCGGCGGAGGCGGCACTTCTGCG 


Xhol 


Rev 


CCCGCTCGAG-GAACCGGTAGCCTACG 


Xhol 


(AG741 )-orf46.1- 
His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCTCAGATTTGGCAAACGATTC 
GCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAGA 


Sail 


Rev 


CCCGCTCGAG-CGTATCATATTTCACGTGC 


Xhol 


(AG983)- 
741(MC58)-His 


Fwd 


GCGGCCTCGAG-GGATCCGGAGGGGGTGGTGTCGCC 


Xhol 


Rev 


CCCGCTCGAG-TTGCTTGGCGGCAAG 


Xhol 


(AG983)-961c-His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCGCAGCCACAAACGACGACGA 
GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG 


Xhol 




Rev 


CCCGCTCGAG-ACCCAGCTTGTAAGGTTG 


Xhol 


(AG983)-961-His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCGCAGCCACAAACGACGACGA 
GCGGCCTCGAG-GGTGGCGGAGGCACTGGATCCGCAG 


Xhol 


Rev 


CCCGCTCGAG-CCACTCGTAATTGACGCC 


Xhol 


(AG983)- Orf46.1- 
His 


Fwdl 
Fwd2 


GGAGGCACTGGATCCTCAGATTTGGCAAACGATTC 
GCGGCGTCGACGGTGGCGGAGGCACTGGATCCTCAGA 


Sail 


Rev 


CCCGCTCGAG-CGTATCATATTTCACGTGC 


Xhol 



* This primer was used as a Reverse primer for all the C terminal fusions of 287 to the His-tag. 

§ Forward primers used in combination with the 287-His Reverse primer. 
NB - All PCR reactions use strain 2996 unless otherwise specified (e.g. strain MC58) 



[0293] In all constructs starting with an ATG not followed by a unique Nhe\ site, the ATG codon is part of the A/del 
site used for cloning. The constructs made using Nhe\ as a cloning site at the 5' end (e.g. all those containing 287 at 
the N-terminus) have two additional codons (GCT AGC) fused to the coding sequence of the antigen. 

Preparation of chromosomal DNA templates 

[0294] N. meningitidis strains 2996, MC58, 394.98, 1000 and BZ232 (and others) were grown to exponential phase 
in 1 00ml of GC medium, harvested by centrifugation, and resuspended in 5ml buffer (20% w/v sucrose, 50mM Tris-HCI, 
50mM EDTA, pH8). After 1 0 minutes incubation on ice, the bacteria were lysed by adding 1 0ml of lysis solution (50mM 
NaCI, 1% Na-Sarkosyl, 50|jig/ml Proteinase K), and the suspension incubated at 37°C for 2 hours. Two phenol extractions 
(equilibrated to pH 8) and one CHCI 3 /isoamylalcohol (24:1 ) extraction were performed. DNA was precipitated by addition 
of 0.3M sodium acetate and 2 volumes of ethanol, and collected by centrifugation. The pellet was washed once with 
70%(v/v) ethanol and redissolved in 4.0ml TE buffer (10mM Tris-HCI, 1 mM EDTA, pH 8.0). The DNA concentration was 
measured by reading OD 260 . 

PCR Amplification 

[0295] The standard PCR protocol was as follows: 200ng of genomic DNA from 2996, MC581 000, or BZ232 strains 
or 10ng of plasmid DNA preparation of recombinant clones were used as template in the presence of 40jjlM of each 
oligonucletide primer, 400-800 |jlM dNTPs solution, 1x PCR buffer (including 1 .5mM MgCI 2 ), 2.5 units Taql DNA polymer- 
ase (using Perkin-Elmer AmpliTaQ, Boerhingher Mannheim Expand™ Long Template). 

[0296] After a preliminary 3 minute incubation of the whole mix at 95°C, each sample underwent a two-step amplifi- 
cation: the first 5 cycles were performed using the hybridisation temperature that excluded the restriction enzyme tail of 
the primer (T m1 ). This was followed by 30 cycles according to the hybridisation temperature calculated for the whole 
length oligos (T m2 ). Elongation times, performed at 68°C or 72°C, varied according to the length of the Orf to be amplified. 
In the case of Orf1 the elongation time, starting from 3 minutes, was increased by 15 seconds each cycle. The cycles 
were completed with a 10 minute extension step at 72°C. 

[0297] The amplified DNA was either loaded directly on a 1% agarose gel. The DNA fragment corresponding to the 
band of correct size was purified from the gel using the Qiagen Gel Extraction Kit, following the manufacturer's protocol. 
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Digestion of PCR fragments and of the cloning vectors 

[0298] The purified DNA corresponding to the amplified fragment was digested with the appropriate restriction enzymes 
for cloning into pET-21 b+, pET22b+ or pET-24b+. Digested fragments were purified using the QIAquick PCR purification 
5 kit (following the manufacturer's instructions) and eluted with either H 2 0 or 10mM Tris, pH 8.5. Plasmid vectors were 
digested with the appropriate restriction enzymes, loaded onto a 1 .0% agarose gel and the band corresponding to the 
digested vector purified using the Qiagen QIAquick Gel Extraction Kit. 

Cloning 

10 

[0299] The fragments corresponding to each gene, previously digested and purified, were ligated into pET21 b+, 
pET22b+ or pET-24b+. A molar ratio of 3:1 fragment/vector was used with T4 DNA ligase in the ligation buffer supplied 
by the manufacturer. 

[0300] Recombinant plasmid was transformed into competent E.coli DH5 or HB1 01 by incubating the ligase reaction 
15 solution and bacteria for 40 minutes on ice, then at 37°C for 3 minutes. 

[0301] This was followed by the addition of 800|jlI LB broth and incubation at 37°C for 20 minutes. The cells were 
centrifuged at maximum speed in an Eppendorf microfuge, resuspended in approximately 200jxl of the supernatant and 
plated onto LB ampicillin (100mg/ml) agar. 

[0302] Screening for recombinant clones was performed by growing randomly selected colonies overnight at 37°C in 
20 4.0ml of LB broth + 1 0O^g/ml ampicillin. Cells were pelleted and plasmid DNA extracted using the Qiagen QIAprep Spin 
Miniprep Kit, following the manufacturer's instructions. Approximately 1 jxg of each individual miniprep was digested with 
the appropriate restriction enzymes and the digest loaded onto a 1 -1 .5% agarose gel (depending on the expected insert 
size), in parallel with the molecular weight marker (1kb DNA Ladder, GIBCO). Positive clones were selected on the 
basis of the size of insert. 

25 

Expression 

[0303] After cloning each gene into the expression vector, recombinant plasmids were transformed into E.coli strains 
suitable for expression of the recombinant protein. VI of each construct was used to transform E.coli BL21-DE3 as 

30 described above. Single recombinantcolonies were inoculated into2ml LB+Amp (1 00|xg/ml), incubated at37°C overnight, 
then diluted 1 :30 in 20ml of LB+Amp (1 OOjxg/ml) in 1 00ml flasks, to give an OD 600 between 0.1 and 0.2. The flasks were 
incubated at 30°C or at 37°C in a gyratory water bath shaker until OD 600 indicated exponential growth suitable for 
induction of expression (0.4-0.8 OD). Protein expression was induced by addition of 1 .OmM IPTG. After 3 hours incubation 
at 30°C or 37°C the OD 600 was measured and expression examined. 1 .0ml of each sample was centrifuged in a microfuge, 

35 the pellet resuspended in PBS and analysed by SDS-PAGE and Coomassie Blue staining. 

Gateway cloning and expression 

[0304] Sequences labelled GATE were cloned and expressed using the GATEWAY Cloning Technology (GIBCO- 
40 BRL). Recombinational cloning (RC) is based on the recombination reactions that mediate the integration and excision 
of phage into and from the E.coli genome, respectively. The integration involves recombination of the attP site of the 
phage DNA within the affBsite located in the bacterial genome (BP reaction) and generates an integrated phage genome 
flanked by attL and attR sites. The excision recombines attL and attR sites back to attP and attB sites (LR reaction). 
The integration reaction requires two enzymes [the phage protein Integrase (Int) and the bacterial protein integration 
45 host factor (IHF)] (BP clonase). The excision reaction requires Int, IHF, and an additional phage enzyme, Excisionase 
(Xis) (LR clonase). Artificial derivatives of the 25-bp bacterial attB recombination site, referred to as B 1 and B2, were 
added to the 5' end of the primers used in PCR reactions to amplify Neisserial ORFs. The resulting products were BP 
cloned into a "Donor vector" containing complementary derivatives of the phage attP recombination site (P1 and P2) 
using BP clonase. The resulting "Entry clones" contain ORFs flanked by derivatives of the attL site (LI and L2) and were 
50 subcloned into expression "destination vectors" which contain derivatives of the attL -compatible attR sites (R1 and R2) 
using LR clonase. This resulted in "expression clones" in which ORFs are flanked by B1 and B2 and fused in frame to 
the GST or His N terminal tags. 

[0305] The E. coli strain used for GATEWAY expression is BL21 -SI. Cells of this strain are induced for expression of 
the T7 RNA polymerase by growth in medium containing salt (0.3 M NaCI). 
55 [0306] Note that this system gives N-terminus His tags. 
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Preparation of membrane proteins. 

[0307] Fractions composed principally of either inner, outer or total membrane were isolated in order to obtain recom- 
binant proteins expressed with membrane-localisation leader sequences. The method for preparation of membrane 

5 fractions, enriched for recombinant proteins, was adapted from Filip et. al. [J.Bact. (1973) 115:717-722] and Davies et. 
al. [J. Immunol. Meth. (1990) 143:215-225]. Single colonies harbouring the plasmid of interest were grown overnight at 
37°C in 20 ml of LB/Amp (100 ^xg/ml) liquid culture. Bacteria were diluted 1:30 in 1 .0 L of fresh medium and grown at 
either 30°C or 37°C until the OD 550 reached 0.6-0.8. Expression of recombinant protein was induced with I PTG at a final 
concentration of 1 .0 mM. After incubation for 3 hours, bacteria were harvested by centrifugation at8000gfor 15 minutes 

10 at 4°C and resuspended in 20 ml of 20 mM Tris-HCI (pH 7.5) and complete protease inhibitors (Boehringer-Mannheim). 
All subsequent procedures were performed at 4°C or on ice. 

[0308] Cells were disrupted by sonication using a Branson Sonifier450 and centrifuged at5000g for 20 min to sediment 
unbroken cells and inclusion bodies. The supernatant, containing membranes and cellular debris, was centrifuged at 
50000g (Beckman Ti50, 29000rpm) for 75 min, washed with 20 mM Bis-tris propane (pH 6.5), 1 .0 M NaCI, 10% (v/v) 

15 glycerol and sedimented again at 50000g for 75 minutes. The pellet was resuspended in 20mM Tris-HCI (pH 7.5), 2.0% 
(v/v) Sarkosyl, complete protease inhibitor (1 .0 mM EDTA, final concentration) and incubated for 20 minutes to dissolve 
inner membrane. Cellular debris was pelleted by centrifugation at 5000g for 1 0 min and the supernatant centrifuged at 
75000g for 75 minutes (Beckman Ti50, 33000rpm). Proteins 008L and 51 9L were found in the supernatant suggesting 
inner membrane localisation. For these proteins both inner and total membrane fractions (washed with NaCI as above) 

20 were used to immunise mice. Outer membrane vesicles obtained from the 75000g pellet were washed with 20 mM Tris- 
HCI (pH 7.5) and centrifuged at 75000g for 75 minutes or overnight. The OMV was finally resuspended in 500 jjlI of 20 
mM Tris-HCI (pH 7.5), 10% v/v glycerol. OrfIL and Orf40L were both localised and enriched in the outer membrane 
fraction which was used to immunise mice. Protein concentration was estimated by standard Bradford Assay (Bio-Rad), 
while protein concentration of inner membrane fraction was determined with the DC protein assay (Bio-Rad). Various 

25 fractions from the isolation procedure were assayed by SDS-PAGE. 

Purification of His-tagged proteins 

[0309] Various forms of 287 were cloned from strains 2996 and MC58. They were constructed with a C-terminus His- 

30 tagged fusion and included a mature form (aa 18-427), constructs with deletions (A1, A 2, A3 and A4) and clones 
composed of either B or C domains. For each clone purified as a His-fusion, a single colony was streaked and grown 
overnight at 37°C on a LB/Amp (100 |xg/ml) agar plate. An isolated colony from this plate was inoculated into 20ml of 
LB/Amp (100 |jig/ml) liquid medium and grown overnight at 37°C with shaking. The overnight culture was diluted 1 :30 
into 1 .0 L LB/Amp (1 00 jjig/ml) liquid medium and allowed to grow at the optimal temperature (30 or 37°C) until the OD 550 

35 reached 0.6-0.8. Expression of recombinant protein was induced by addition of IPTG (final concentration 1 .OmM) and 
the culture incubated for a further 3 hours. Bacteria were harvested by centrifugation at 8000g for 15 min at 4°C. The 
bacterial pellet was resuspended in 7.5 ml of either (i) cold buffer A (300 mM NaCI, 50 mM phosphate buffer, 10 mM 
imidazole, pH 8.0) for soluble proteins or (ii) buffer B (10mM Tris-HCI, 100 mM phosphate buffer, pH 8.8 and, optionally, 
8M urea) for insoluble proteins. Proteins purified in a soluble form included 287-His, A1 , A2, A3 and A4287-His, 

40 A4287MC58-His, 287c-His and 287cMC58-His. Protein 287bMC58-His was insoluble and purified accordingly. Cells 
were disrupted by sonication on ice four times for 30 sec at 40W using a Branson sonifier 450 and centrifuged at 1 3000xg 
for 30 min at 4°C. For insoluble proteins, pellets were resuspended in 2.0 ml buffer C (6 M guanidine hydrochloride, 1 00 
mM phosphate buffer, 1 0 mM Tris- HCI, pH 7.5 and treated with 1 0 passes of a Dounce homogenizer. The homogenate 
was centrifuged at 13000g for 30 min and the supernatant retained. Supernatants for both soluble and insoluble prep- 

45 arations were mixed with 150jxl Ni 2+ -resin (previously equilibrated with either buffer A or buffer B, as appropriate) and 
incubated at room temperature with gentle agitation for 30 min. The resin was Chelating Sepharose Fast Flow (Phar- 
macia), prepared according to the manufacturer's protocol. The batch-wise preparation was centrifuged at 700g for 5 
min at 4°C and the supernatant discarded. The resin was washed twice (batch-wise) with 10ml buffer A or B for 1 0 min, 
resuspended in 1 .0 ml buffer A or B and loaded onto a disposable column. The resin continued to be washed with either 

50 (j) buffer A at 4°C or (ii) buffer B at room temperature, until the OD 280 of the flow-through reached 0.02-0.01 . The resin 
was further washed with either (i) cold buffer C (300mM NaCI, 50mM phosphate buffer, 20mM imidazole, pH 8.0) or (ii) 
buffer D (10mM Tris-HCI, 100mM phosphate buffer, pH 6.3 and, optionally, 8M urea) until OD 280 of the flow-through 
reached 0.02-0.01. The His-fusion protein was eluted by addition of 700|xl of either (i) cold elution buffer A (300 mM 
NaCI, 50mM phosphate buffer, 250 mM imidazole, pH 8.0) or (ii) elution buffer B (10 mM Tris-HCI, 100 mM phosphate 

55 buffer, pH 4.5 and, optionally, 8M urea) and fractions collected until the OD 280 indicated all the recombinant protein was 
obtained. 20jxl aliquots of each elution fraction were analysed by SDS-PAGE. Protein concentrations were estimated 
using the Bradford assay. 
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Renaturation of denatured His-fusion proteins. 

[0310] Denaturation was required to solubilize 287bMC8, so a renaturation step was employed prior to immunisation. 
Glycerol was added to the denatured fractions obtained above to give a final concentration of 10% v/v. The proteins 
5 were diluted to 200 |jig/ml using dialysis buffer I (10% v/v glycerol, 0.5M arginine, 50 mM phosphate buffer, 5.0 mM 
reduced glutathione, 0.5 mM oxidised glutathione, 2.0M urea, pH 8.8) and dialysed against the same buffer for 12-14 
hours at 4°C. Further dialysis was performed with buffer II (10% v/v glycerol, 0.5M arginine, 50mM phosphate buffer, 
5.0mM reduced glutathione, 0.5mM oxidised glutathione, pH 8.8) for 12-14 hours at 4°C. Protein concentration was 
estimated using the formula: 

10 

Protein (mg/ml) = (L 55 x OD 28 o) - (0. 76 x OD 26 o) 

1 5 A mi no acid sequence analysis. 

[0311] Automated sequence analysis of the NH 2 -terminus of proteins was performed on a Beckman sequencer (LF 
3000) equipped with an on-line phenylthiohydantoin-amino acid analyser (System Gold) according to the manufacturer's 
recommendations. 

20 

Immunization 

[0312] Balb/C mice were immunized with antigens on days 0, 21 and 35 and sera analyzed at day 49. 

25 Sera analysis - ELISA 

[0313] The acapsulated MenB M7 and the capsulated strains were plated on chocolate agar plates and incubated 
overnight at 37°C with 5% C0 2 . Bacterial colonies were collected from the agar plates using a sterile dracon swab and 
inoculated into Mueller-Hinton Broth (Difco) containing 0.25% glucose. Bacterial growth was monitored every 30 minutes 

30 by following OD 620 - The bacteria were let to grow until the OD reached the value of 0.4-0.5. The culture was centrifuged 
for 1 0 minutes at 4000rpm. The supernatant was discarded and bacteria were washed twice with PBS, resuspended in 
PBS containing 0.025% formaldehyde, and incubated for 1 hour at 37°C and then overnight at 4°C with stirring. 100jxl 
bacterial cells were added to each well of a 96 well Greiner plate and incubated overnight at 4°C. The wells were then 
washed three times with PBT washing buffer (0.1% Tween-20 in PBS). 200jxl of saturation buffer (2.7% polyvinylpyrro- 

35 lidone 1 0 in water) was added to each well and the plates incubated for 2 hours at 37°C. Wells were washed three times 
with PBT. 200|xl of diluted sera (Dilution buffer: 1% BSA, 0.1 % Tween-20, 0.1 % NaN 3 in PBS) were added to each 
well and the plates incubated for 2 hours at 37°C. Wells were washed three times with PBT. 1 OOjjlI of HRP-conjugated 
rabbit anti-mouse (Dako) serum diluted 1 :2000 in dilution buffer were added to each well and the plates were incubated 
for 90 minutes at 37°C. Wells were washed three times with PBT buffer. 1 OOjjlI of substrate buffer for HRP (25ml of 

40 citrate buffer pH5, 1 0mg of O-phenildiamine and 1 0jxl of H 2 0 2 ) were added to each well and the plates were left at room 
temperature for 20 minutes. 1 OOjjlI 12.5% H 2 S0 4 was added to each well and OD 490 was followed. The ELISA titers 
were calculated abitrarely as the dilution of sera which gave an OD 490 value of 0.4 above the level of preimmune sera. 
The ELISA was considered positive when the dilution of sera with OD 490 of 0.4 was higher than 1 :400. 

45 Sera analysis - FACS Scan bacteria binding assay 

[0314] The acapsulated MenB M7 strain was plated on chocolate agar plates and incubated overnight at 37°C with 
5% C0 2 . Bacterial colonies were collected from the agar plates using a sterile dracon swab and inoculated into 4 tubes 
containing 8ml each Mueller-Hinton Broth (Difco) containing 0.25% glucose. Bacterial growth was monitored every 30 

50 minutes by following OD 620 . The bacteria were let to grow until the OD reached the value of 0.35-0.5. The culture was 
centrifuged for 10 minutes at 4000rpm. The supernatant was discarded and the pellet was resuspended in blocking 
buffer (1% BSA in PBS, 0.4% NaN 3 ) and centrifuged for 5 minutes at 4000rpm. Cells were resuspended in blocking 
buffer to reach OD 620 of 0.05. 1 OOjjlI bacterial cells were added to each well of a Costar 96 well plate. 1 OOjjlI of diluted 
(1 :1 00, 1 :200, 1 :400) sera (in blocking buffer) were added to each well and plates incubated for 2 hours at 4°C. Cells 

55 were centrifuged for 5 minutes at 4000rpm, the supernatant aspirated and cells washed by addition of 200jjJ/well of 
blocking buffer in each well. 1 0Ojxl of R-Phicoerytrin conjugated F(ab) 2 goat anti-mouse, diluted 1 :100, was added to 
each well and plates incubated for 1 hour at 4°C. Cells were spun down by centrifugation at 4000rpm for 5 minutes and 
washed by addition of 200jxl/well of blocking buffer. The supernatant was aspirated and cells resuspended in 200jxl/ 
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well of PBS, 0.25% formaldehyde. Samples were transferred to FACScan tubes and read. The condition for FACScan 
(Laser Power 15mW) setting were: FL2 on; FSC-H threshold:92; FSC PMT Voltage: E 01 ; SSC PMT: 474; Amp. Gains 
6.1; FL-2 PMT: 586; compensation values: 0. 

5 Sera analysis - bactericidal assay 

[0315] N. meningitidis strain 2996 was grown overnight at37°C on chocolate agar plates (starting from a frozen stock) 
with 5% C<0 2 . Colonies were collected and used to inoculate 7ml Mueller-Hinton broth, containing 0.25% glucose to 
reach an OD 620 of 0.05-0.08. The culture was incubated for approximately 1 .5 hours at 37 degrees with shacking until 
10 the OD 620 reached the value of 0.23-0.24. Bacteria were diluted in 50mM Phosphate buffer pH 7.2 containing 10mM 
MgCI 2 , 10mM CaCI 2 and 0.5% (w/v) BSA (assay buffer) at the working dilution of 10 5 CFU/ml. The total volume of the 
final reaction mixture was 50 \x\ with 25 jxl of serial two fold dilution of test serum, 12.5 ptl of bacteria at the working 
dilution, 12.5 jxl of baby rabbit complement (final concentration 25%). 

[0316] Controls included bacteria incubated with complement serum, immune sera incubated with bacteria and with 
15 complement inactivated by heating at 56°C for 30'. Immediately after the addition of the baby rabbit complement, 1 0jxl 
of the controls were plated on Mueller-Hinton agar plates using the tilt method (time 0). The 96-wells plate was incubated 
for 1 hour at 37°C with rotation. 7 jxl of each sample were plated on Mueller-Hinton agar plates as spots, whereas 10|xl 
of the controls were plated on Mueller-Hinton agar plates using the tilt method (time 1). Agar plates were incubated for 
18 hours at 37 degrees and the colonies corresponding to time 0 and time 1 were counted. 

20 

Sera analysis - western blots 

[031 7] Purified proteins (500ng/lane), outer membrane vesicles (5jxg) and total cell extracts (25jxg) derived from MenB 
strain 2996 were loaded onto a 1 2% SDS-polyacrylamide gel and transferred to a nitrocellulose membrane. The transfer 

25 was performed for 2 hours at 150mA at 4°C, using transfer buffer (0.3% Tris base, 1 .44% glycine, 20% (v/v) methanol). 
The membrane was saturated by overnight incubation at 4°C in saturation buffer (1 0% skimmed milk, 0.1 % Triton X1 00 
in PBS). The membrane was washed twice with washing buffer (3% skimmed milk, 0.1% Triton X100 in PBS) and 
incubated for 2 hours at 37°C with mice sera diluted 1 :200 in washing buffer. The membrane was washed twice and 
incubated for 90 minutes with a 1 :2000 dilution of horseradish peroxidase labelled anti-mouse Ig. The membrane was 

30 washed twice with 0.1% Triton X100 in PBS and developed with the Opti-4CN Substrate Kit (Bio-Rad). The reaction 
was stopped by adding water. 

[0318] The OMVs were prepared as follows: N. meningitidis strain 2996 was grown overnight at 37 degrees with 5% 
C0 2 on 5 GC plates, harvested with a loop and resuspended in 10 ml of 20mM Tris-HCI pH 7.5, 2 mM EDTA. Heat 
inactivation was performed at 56°C for 45 minutes and the bacteria disrupted by sonication for 5 minutes on ice (50% 

35 duty cycle, 50% output, Branson sonifier 3 mm microtip). Unbroken cells were removed by centrifugation at 5000g for 
10 minutes, the supernatant containing the total cell envelope fraction recovered and further centrifuged overnight at 
50000g at the temperature of 4°C . The pellet containing the membranes was resuspended in 2% sarkosyl, 20mM Tris- 
HCI pH 7.5, 2 mM EDTA and incubated at room temperature for 20 minutes to solubilise the inner membranes. The 
suspension was centrifuged at 10000g for 10 minutes to remove aggregates, the supernatant was further centrifuged 

40 at 50000g for 3 hours. The pellet, containing the outer membranes was washed in PBS and resuspended in the same 
buffer. Protein concentration was measured by the D.C. Bio-Rad Protein assay (Modified Lowry method), using BSA 
as a standard. 

[0319] Total cell extracts were prepared as follows: N. meningitidis strain 2996 was grown overnight on a GC plate, 
harvested with a loop and resuspended in 1 ml of 20mM Tris-HCI. Heat inactivation was performed at 56°C for 30 minutes. 

45 

961 domain studies 

[0320] Cellular fractions preparation Total lysate, periplasm, supernatant and OMV of F.co//clones expressing different 
domains of 961 were prepared using bacteria from over-night cultures or after 3 hours induction with IPTG. Briefly, the 

50 periplasm were obtained suspending bacteria in saccarose 25% and Tris 50mM (pH 8) with polimixine 100[xg/ml. After 
1 hr at room temperature bacteria were centrifuged at 13000rpm for 15 min and the supernatant were collected. The 
culture supernatant were filtered with 0.2jxm and precipitated with TCA 50% in ice for two hours. After centrifugation 
(30 min at 13000 rp) pellets were rinsed twice with ethanol 70% and suspended in PBS. The OMV preparation was 
performed as previously described. Each cellular fraction were analyzed in SDS-PAGE or in Western Blot using the 

55 polyclonal anti-serum raised against GST-961 . 

[0321] Adhesion assay Chang epithelial cells (Wong-Kilbourne derivative, clone 1-5c-4, human conjunctiva) were 
maintained in DMEM (Gibco) supplemented with 10% heat-inactivated FCS, 15mM L-glutamine and antibiotics. 
[0322] For the adherence assay, sub-confluent culture of Chang epithelial cells were rinsed with PBS and treated with 



94 



EP 1 790 660 A2 



trypsin-EDTA (Gibco), to release them from the plastic support. The cells were then suspended in PBS, counted and 
dilute in PBS to 5x1 0 5 cells/ml. 

[0323] Bacteria from over-night cultures or after induction with IPTG, were pelleted and washed twice with PBS by 
centrifuging at 13000 for 5 min. Approximately 2-3x1 0 8 (cfu) were incubated with 0.5 mg/ml FITC (Sigma) in 1ml buffer 
containing 50mM NaHC0 3 and 100mM NaCI pH 8, for 30 min at room temperature in the dark. FITC-labeled bacteria 
were wash 2-3 times and suspended in PBS at 1-1 .5x10 9 /ml. 200jxl of this suspension (2-3x1 0 8 ) were incubated with 
200(xl (1 x1 0 5 ) epithelial cells for 30min a 37°C. Cells were than centrifuged at 2000rpm for 5 min to remove non-adherent 
bacteria, suspended in 200^x1 of PBS, transferred to FACScan tubes and read 

Annex to the application documents - subsequently filed sequences listing 

[0324] 
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SEQUENCE LISTING 


<110> 


Chiron SRL 


<120> 


Heterologous Expression 


<±D\J> 




<±H\J> 

<141> 


UDU / \Jf -LO . J 

2001-02-28 


<150> 
<151> 


0004695.3 
2000-02-28 


<150> 

<1j1> 


0027675.8 
-)nnn 1 1 i 3 

L.\j\j\J— 


<160> 


620 


<170> 


SeqWin99, version 1.02 


<210> 
<211> 
<212> 
<213> 


1 

441 
PRT 

Neisseria meningitidis 


<400> 
Met Lys 


1 

Lys Tyr Leu Phe Arg Ala 



of Neisseria! Proteins 



lie Leu Ala Ala Cys Gin Ser Lys 
20 

Asp Thr Ser val lie Asn Gly Pro 
35 40 

Pro Ala Gly Thr Thr Val Gly Gly 
50 55 

Pro His Leu Ser Leu Pro His Trp 
65 70 

Leu Gin ser Phe Arg Leu Gly Cys 
85 

Trp Gin Asp Val Cys Ala Gin Ala 
100 

Gin Ala Lys Gin Phe Phe Gl u Arg 
115 120 

Gly Asn Gly Ser Leu Ala Gly Thr 
130 135 

Leu Lys Gly Asp Asp Arg Arg Thr 
145 150 

Gly lie Pro Asp Asp Phe lie ser 
165 

Ser Gly Lys Ala Leu Val Arg lie 
180 

Thr lie Asp Asn Thr Gly Gly Thr 

195 200 

Pro lie Thr Ala Arg Thr Thr Ala 

210 215 



Ala Leu 
10 

Ser lie 
25 

Asp Arg 

Gly Gly 

Ala Ala 

Ala Asn 
90 

Phe Gin 

105 

Tyr Phe 

Val Thr 

Ala Gin 

val pro 
170 

Arg Gin 
185 

His Thr 
He Lys 



Tyr Gly lie Ala 
Gin Thr 
pro val 



Phe Pro 
30 



Ala Val 
60 

Gin Asp 
75 

Leu Lys 
Thr Pro 
Thr Pro 



Gly lie 
45 

Tyr Thr 

Phe Ala 

Asn Arg 

Val His 
110 

Trp Gin 
125 



Gly Tyr 
140 

Ala Arg 
15 5 



Tyr Glu 
Phe Pro 
Leu pro Ala Gly 



Ala Ala 
15 

Gin Pro 

Pro Asp 

Val Val 

Lys Ser 
80 

Gin Gly 
95 

Ser Phe 
val Ala 
Pro Val 



lie Tyr 
160 

Leu Arg 

175 



Thr Gly 
Ala Asp 



Gly Arg 
220 



Ser Gly 
Arg Phe 
Phe Glu Gly Ser 



Lys Asn 
190 

Leu Ser 
205 
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Arg Phe Leu Pro Tyr His Thr Arg Asn Gin lie Asn Gly Gly Ala Leu 

225 230 235 240 

Asp Gly Lys Ala Pro He Leu Gly Tyr Ala Glu Asp Pro Val Glu Leu 
245 250 255 

Phe Phe Met His lie Gin Gly ser Gly Arg Leu Lys Thr pro Ser Gly 
260 265 270 

Lys Tyr lie Arg lie Gly Tyr Ala Asp Lys Asn Glu His Pro Tyr Val 
275 280 285 

Ser lie Gly Arg Tyr Met Ala Asp Lys Gly Tyr Leu Lys Leu Gly Gin 
290 295 300 

Thr Ser Met Gin Gly He Lys Ala Tyr Met Arg Gin Asn Pro Gin An 
305 310 315 32 

Leu Ala Glu Val Leu Gly Gin Asn Pro Ser Tyr lie Phe Phe Arg Glu 
325 330 335 

Leu Ala Gly Ser Ser Asn Asp Gly Pro val Gly Ala Leu Gly Thr Pro 
340 345 350 

Leu Met Gly Glu Tyr Ala Gly Ala val Asp Arg His Tyr lie Thr Leu 
355 360 365 

Gly Ala Pro Leu Phe val Ala Thr Ala His Pro val Thr Arg Lys Ala 
370 375 380 

Leu Asn Arg Leu lie Met Ala Gin Asp Thr Gly Ser Ala lie Lys Gly 
385 390 395 400 

Ala Val Arg Val Asp Tyr Phe Trp Gly Tyr Gly Asp Glu Ala Gly Glu 
405 410 415 

Leu Ala Gly Lys Gin Lys Thr Thr Gly Tyr val Trp Gin Leu Leu Pro 
420 425 430 

Asn Gly Met Lys Pro Glu Tyr Arg Pro 
435 440 

<210> 2 
<211> 420 
<212> PRT 

<213> Neisseria meningitidis 
<400> 2 

Gin Ser Lys Ser lie Gin Thr Phe Pro Gin Pro Asp Thr Ser Val lie 
15 10 15 

Asn Gly Pro Asp Arg Pro Val Gly lie Pro Asp Pro Ala Gly Thr Thr 
20 25 30 

val Gly Gly Gly Gly Ala val Tyr Thr val val Pro His Leu ser Leu 
35 40 45 

Pro His Trp Ala Ala Gin Asp Phe Ala Lys Ser Leu Gin Ser Phe Arg 
50 55 60 

Leu Gly Cys Ala Asn Leu Lys Asn Arg Gin Gly Trp Gin Asp val Cys 
65 70 75 80 

Ala Gin Ala Phe Gin Thr Pro Val His ser Phe Gin Ala Lys Gin Phe 
85 90 95 

Phe Glu Arg Tyr Phe Thr Pro Trp Gin val Ala Gly Asn Gly Ser Leu 
100 105 110 
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Ala Gly Thr Val Thr Gly Tyr Tyr Glu Pro Val Leu Lys Gly Asp Asp 
115 120 125 

Arg Arg Thr Ala Gin Ala Arg Phe Pro lie Tyr Gly lie Pro Asp Asp 
130 135 140 

Phe lie Ser val Pro Leu Pro Ala Gly Leu Arg ser Gly Lys Ala Leu 
145 150 155 160 

Val Arg lie Arg Gin Thr Gly Lys Asn Ser Gly Thr lie Asp Asn Thr 
165 170 175 

Gly Gly Thr His Thr Ala Asp Leu Ser Arg Phe Pro lie Thr Ala Arg 
180 185 190 

Thr Thr Ala lie Lys Gly Arg Phe Glu Gly Ser Arg Phe Leu Pro Tyr 
195 200 205 

His Thr Arg Asn Gin lie Asn Gly Gly Ala Leu Asp Gly Lys Ala Pro 
210 215 220 

lie Leu Gly Tyr Ala Glu Asp Pro Val Glu Leu Phe Phe Met His lie 
225 230 235 240 

Gin Gly Ser Gly Arg Leu Lys Thr Pro Ser Gly Lys Tyr lie Arg lie 
245 250 255 

Gly Tyr Ala Asp Lys Asn Glu His Pro Tyr val ser lie Gly Arg Tyr 
260 265 270 

Met Ala Asp Lys Gly Tyr Leu Lys Leu Gly Gin Thr Ser Met Gin Gly 
275 280 285 

lie Lys Ala Tyr Met Arg Gin Asn Pro Gin Arg Leu Ala Glu Val Leu 
290 295 300 

Gly Gin Asn Pro Ser Tyr lie Phe Phe Arg Glu Leu Ala Gly Ser Ser 
305 310 315 320 

Asn Asp Gly pro val Gly Ala Leu Gly Thr pro Leu Met Gly Glu Tyr 
325 330 335 

Ala Gly Ala Val Asp Arg His Tyr lie Thr Leu Gly Ala Pro Leu Phe 
340 345 350 

Val Ala Thr Ala His Pro Val Thr Arg Lys Ala Leu Asn Arg Leu lie 
355 360 365 

Met Ala Gin Asp Thr Gly Ser Ala lie Lys Gly Ala Val Arg Val Asp 
370 375 380 

Tyr Phe Trp Gly Tyr Gly Asp Glu Ala Gly Glu Leu Ala Gly Lys Gin 
385 390 395 400 

Lys Thr Thr Gly Tyr val Trp Gin Leu Leu Pro Asn Gly Met Lys Pro 
405 410 415 

Glu Tyr Arg Pro 
420 

<210> 3 
<211> 440 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 919 
<400> 3 
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Met Lys Thr 
Leu Ala Ala 



Thr Ser val 
35 

Ala Gly Thr 
50 

His Leu Ser 
65 

Gin ser phe 

Gin Asp Val 

Ala Lys Gin 
115 

Asn Gly ser 
130 

Lys Gly Asp 
145 

lie Pro Asp 

Gly Lys Ala 

lie Asp Asn 
195 

lie Thr Ala 
210 

Phe Leu Pro 

225 

Gly Lys Ala 

Phe Met His 

Tyr lie Arg 
275 

lie Gly Arg 
290 

ser Met Gin 
305 

Ala Glu val 
Ala Gly Ser 



Met Gly Glu 

355 



Phe Phe Lys 
5 

Cys Gin Ser 
20 

lie Asn Gly 

Thr val Gly 

Leu Pro His 
70 

Arg Leu Gly 
85 

cys Ala Gin 
100 

Phe Phe Glu 

Leu Ala Gly 

Asp Arg Arg 
150 

Asp Phe lie 
165 

Leu val Arg 
180 

Thr Gly Gly 

Arg Thr Thr 

Tyr His Thr 
230 

Pro lie Leu 
245 

lie Gin Gly 
260 

lie Gly Tyr 
Tyr Met Ala 



Gly lie Lys 
310 

Leu Gly Gin 

325 

Ser Asn Asp 
340 

Tyr Ala Gly 



Thr Leu 

Lys Ser 

Pro Asp 
40 

Gly Gly 
55 

Trp Ala 

cys Ala 

Ala Phe 

Arg Tyr 
120 

Thr Val 
135 

Thr Ala 

Ser Val 

lie Arg 

Thr His 
200 

Ala lie 
215 

Arg Asn 

Gly Tyr 

ser Gly 

Ala Asp 
280 

Asp Lys 
295 

Ser Tyr 
Asn Pro 
Gly Pro 



Ser Ala 
10 

lie Gin 

25 

Arg Pro 

Gly Ala 

Ala Gin 

Asn Leu 
90 

Gin Thr 
105 

Phe Thr 

Thr Gly 

Gin Ala 

Pro Leu 
170 

Gin Thr 
185 

Thr Ala 
Lys Gly 
Gin He 



Ala Glu 
250 

Arg Leu 

265 

Lys Asn 



Ala Ala 

Thr Phe 

val Gly 

val Tyr 
60 

Asp Phe 
75 

Lys Asn 

Pro val 

Pro Trp 

Tyr Tyr 
140 

Arg Phe 
155 

Pro Ala 

Gly Lys 

Asp. Leu 

Arg Phe 
220 

Asn Gly 
235 

Asp Pro 
Lys Thr 

Glu His 



Leu Ala Leu lie 
15 

Pro Gin Pro Asp 
30 

lie Pro Asp Pro 
45 

Thr val Val Pro 



Ala Lys Ser Leu 
80 

Arg Gin Gly Trp 
95 

His Ser Phe Gin 
110 

Gin val Ala Gly 
125 

Glu Pro val Leu 



Pro lie 

Gly Leu 

Asn Ser 
190 

Ser Arg 
205 

Glu Gly 
Gly Ala 
val Glu 



Tyr Gly 
160 

Arg ser 
175 

Gly Thr 
Phe Pro 
Ser Arg 



Leu Asp 
240 

Leu Phe 

255 



Ala val 
360 



Gly Tyr Leu Lys 
300 

Met Arg Gin Asn 

315 

Ser Tyr lie Phe 
330 

val Gly Ala Leu 
345 

Asp Arg His Tyr 



Gly Lys 
val Ser 
Leu Gly Gin Thr 



pro ser 
270 

Pro Tyr 

285 



Pro Gin Arg Leu 

320 

Phe Arg Glu Leu 

335 

Gly Thr Pro Leu 
350 

lie Thr Leu Gly 

365 
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Ala Pro Leu Phe val Ala Thr Ala His Pro val Thr Arg Lys Ala Leu 
370 375 380 

Asn Arg Leu lie Met Ala Gin Asp Thr Gly Ser Ala lie Lys Gly Ala 
385 ~ 390 395 400 

val Arg val Asp Tyr Phe Trp Gly Tyr Gly Asp Glu Ala Gly Glu Leu 
405 410 415 

Ala Gly Lys Gin Lys Thr Thr Gly Tyr Val Trp Gin Leu Leu Pro Asn 
420 425 430 

Gly Met Lys Pro Glu Tyr Arg Pro 
435 440 

<210> 4 

<211> 58 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 907-2. pep 

<400> 4 

Glu Arg Arg Arg Leu Leu Val Asn lie Gin Tyr Glu Ser Ser Arg Ala 
1 5 10 15 

Gly Leu Asp Thr Gin lie val Leu Gly Leu lie Glu val Glu ser Ala 
20 25 30 

Phe Arg Gin Tyr Ala lie Ser Gly Val Gly Ala Arg Gly Leu Met Gin 
35 40 45 

Val Met Pro Phe Trp Lys Asn Tyr lie Gly 
50 55 

<210> 5 

<211> 60 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 

<400> 5 

Glu Arg Phe Pro Leu Ala Tyr Asn Asp Leu Phe Lys Arg Tyr Thr Ser 
1 5 10 15 

Gly Lys Glu lie Pro Gin Ser Tyr Ala Met Ala lie Ala Arg Gin Glu 
20 25 30 

Ser Ala Trp Asn Pro Lys Val Lys Ser Pro Val Gly Ala Ser Gly Leu 
3 5 40 45 

Met Gin lie Met Pro Gly Thr Ala Thr His Thr val 
50 55 60 

<210> 6 

<211> 120 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 922. pep 

<400> 6 

val Ala Gin Lys Tyr Gly val pro Ala Glu Leu lie val Ala val lie 
1 5 10 15 
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Gly lie Glu Thr Asn Tyr Gly Lys Asn Thr Gly Ser Phe Arg val Ala 
20 25 30 

Asp Ala Leu Ala Thr Leu Gly Phe Asp Tyr Pro Arg Arg Ala Gly Phe 
35 40 45 

Phe Gin Lys Glu Leu val Glu Leu Leu Lys Leu Ala Lys Glu Glu Gly 
50 55 60 

Gly Asp Val Phe Ala Phe Lys Gly Ser Tyr Ala Gly Ala Met Gly Met 
65 70 75 80 

Pro Gin Phe Met Pro Ser Ser Tyr Arg Lys Trp Ala Val Asp Tyr Asp 
85 90 95 

Gly Asp Gly His Arg Asp lie Trp Gly Asn Val Gly Asp Val Ala Ala 
100 105 110 

Ser val Ala Asn Tyr Met Lys Gin 
115 120 

<210> 7 
<211> 119 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Escherichia coli 
<400> 7 

Ala Trp Gin val Tyr Gly val Pro Pro Glu lie lie val Gly lie lie 
15 10 15 

Gly val Glu Thr Arg Trp Gly Arg Val Met Gly Lys Thr Arg lie Leu 
20 25 30 

Asp Ala Leu Ala Thr Leu Ser Phe Asn Tyr Pro Arg Arg Ala Glu Tyr 
35 40 45 

Phe Ser Gly Glu Leu Glu Thr Phe Leu Leu Met Ala Arg Asp Glu Gin 
50 55 60 

Asp Asp Pro Leu Asn Leu Lys Gly Ser Phe Ala Gly Ala Met Gly Tyr 
65 70 75 80 

Gly Gin Phe Met Pro Ser Ser Tyr Lys Gin Tyr Ala Val Asp Phe Ser 
85 90 95 

Gly Asp Gly His lie Asn Leu Trp Asp Pro val Asp Ala He Gly Ser 
100 105 110 

val Ala Asn Tyr Phe Lys Ala 
115 

<210> 8 
<211> 194 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> 919. pep 
<400> 8 

Ala Leu Asp Gly Lys Ala Pro lie Leu Gly Tyr Ala Glu Asp Pro val 
15 10 15 

Glu Leu Phe Phe Met His lie Gin Gly Ser Gly Arg Leu Lys Thr Pro 
20 2 5 30 
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Ser Gly Lys Tyr lie Arg He Gly Tyr Ala Asp Lys Asn Glu His Pro 
35 40 45 

Tyr val Ser lie Gly Arg Tyr Met Ala Asp Lys Gly Tyr Leu Lys Leu 
50 55 60 

Gly Gin Thr Ser Met Gin Gly He Lys Ser Tyr Met Arg Gin Asn Pro 
65 70 75 " 80 

Gin Arg Leu Ala Glu val Leu Gly Gin Asn Pro Ser Tyr lie Phe Phe 
85 90 95 

Arg Glu Leu Ala Gly Ser Ser Asn Asp Gly Pro val Gly Ala Leu Gly 
100 105 110 

Thr Pro Leu Met Gly Glu Tyr Ala Gly Ala Val Asp Arg His Tyr lie 
115 120 125 

Thr Leu Gly Ala Pro Leu Phe Val Ala Thr Ala His Pro Val Thr Arg 
130 135 140 

Lys Ala Leu Asn Arg Leu lie Met Ala Gin Asp Thr Gly Ser Ala lie 
145 150 15 5 160 

Lys Gly Ala Val Arg Val Asp Tyr Phe Trp Gly Tyr Gly Asp Glu Ala 
165 170 175 

Gly Glu Leu Ala Gly Lys Gin Lys Thr Thr Gly Tyr Val Trp Gin Leu 
180 185 190 

Leu Pro 

<210> 9 

<211> 196 

<212> PRT 

<213> Escherichia coli 

<400> 9 

Ala Leu Ser Asp Lys Tyr lie Leu Ala Tyr Ser Asn Ser Leu Met Asp 
1 5 10 15 

Asn Phe lie Met Asp Val Gin Gly Ser Gly Tyr lie Asp Phe Gly Asp 
20 25 30 

Gly Ser Pro Leu Asn Phe Phe Ser Tyr Ala Gly Lys Asn Gly His Ala 
35 40 45 

Tyr Arg Ser lie Gly Lys val Leu lie Asp Arg Gly Glu val Lys Lys 
50 55 60 

Glu Asp Met Ser Met Gin Ala lie Arg His Trp Gly Glu Thr His Ser 
65 70 75 80 

Glu Ala Glu Val Arg Glu Leu Leu Glu Gin Asn Pro Ser Phe val Phe 
85 90 95 

Phe Lys Pro Gin Ser Phe Ala Pro Val Lys Gly Ala Ser Ala Val Pro 
100 105 110 

Leu Val Gly Arg Ala Ser val Ala Ser Asp Arg Ser lie lie Pro Pro 
115 120 12 5 

Gly Thr Thr Leu Leu Ala Glu Val Pro Leu Leu Asp Asn Asn Gly Lys 
130 135 140 

Phe Asn Gly Gin Tyr Glu Leu Arg Leu Met Val Ala Leu Asp val Gly 
145 150 15 5 160 
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10 



15 



20 



25 



30 



35 



40 



45 



Gly Ala lie Lys Gly Gin His Phe Asp lie Tyr Gin Gly lie Gly Pro 
165 170 175 

Glu Ala Gly His Arg Ala Gly Trp Tyr Asn His Tyr Gly Arg val Trp 
180 185 190 

Val Leu Lys Thr 
195 

<210> 10 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Ol i gonucl eoti de 

<400> 10 

cgaagacccc gtcggtcttt tttttatg 28 



<210> 


11 


<211> 


28 


<212> 


DNA 


<213> 


Arti f i ci al Sequence 


<220> 




<223> 


Oligonucleotide 


<400> 


11 


gtgcataaaa aaaagaccga cggggtct 


<210> 


12 


<211> 


25 


<212> 


DNA 


<213> 


Arti f i ci al Sequence 


<220> 




<223> 


Ol i gonucl eoti de 


<400> 


12 


aacgcctcgc cggtgttttg ggtca 


<210> 


13 


<211> 


25 


<212> 


DNA 


<213> 


Arti f i ci al Sequence 


<220> 




<223> 


Ol igonucleotide 


<400> 


13 


tttgacccaa aacaccggcg aggcg 


<210> 


14 


<211> 


26 


<212> 


DNA 


<213> 


Arti f i ci al Sequence 



28 



25 



25 



<220> 

<223> Ol igonucleotide 

50 

<400> 14 

tgccggcgca gtcggtcggc actaca 26 

<210> 15 

<211> 26 

55 <212> DNA 

<213> Artificial Sequence 
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<220> 

<223> ol igonucleotide 

<400> 15 

taatgtagtg ccgaccgact gcgccg 

<210> 16 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 16 

tgattgaggt gggtagcgcg ttccg 

<210> 17 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 17 

ggcggaacgc gctacccacc tcaat 

<210> 18 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 18 

ccggaattct tatgaaaaaa atcatcttcg ccgc 

<210> 19 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 19 

gcccaagctt ttattgtttg gctgcctcga tt 

<210> 20 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 20 

ccggaattct tatgtcgccc gatgttaaat cggcgga 

<210> 21 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 
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<400> 21 

gcccaagctt tcaatcctgc tcttttttgc eg 

<210> 22 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 22 

ceggaattet tatgagecaa gatatggegg cagt 

<210> 23 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 23 

gcccaagctt tcaatcctgc tcttttttgc eg 

<210> 24 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucl eotide 

<400> 24 

ceggaattet tatgtccgcc gaatccgcaa atca 

<210> 25 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 25 

gcccaagctt tcaatcctgc tcttttttgc eg 

<210> 26 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 26 

ceggaattet tatgggaagg gttgatttgg ctaatg 

<210> 27 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 27 

gcccaagctt tcaatcctgc tcttttttgc eg 
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<210> 28 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 28 

ccggaattct tatgtcagat ttggcaaacg attctt 

<210> 29 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 29 

gcccaagctt ttacgtatca tatttcacgt gcttc 

<210> 30 

<211> 37 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oli gonucl eotide 

<400> 30 

ccggaattct tatgtcgccc gatgttaaat cggcgga 

<210> 31 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucl eotide 

<400> 31 

gcccaagctt ttacgtatca tatttcacgt gcttc 

<210> 32 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 32 

ccggaattct tatgcaaagc aagagcatcc aaacct 

<210> 33 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucl eotide 

<400> 33 

gcccaagctt ttacgggcgg tattcgggct 

<210> 34 

<211> 29 

<212> DNA 



106 



EP 1 790 660 A2 



<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 
<400> 34 

ccggaattca tatgaaacac tttccatcc 



<210> 


35 


<211> 


28 


<212> 


DNA 


<213> 


Arti f i ci al Sequence 


<220> 




<223> 


Ol igonucleotide 


<400> 


35 



gcccaagctt ttaccactcg taattgac 



<210> 


36 


<211> 


29 


<212> 


DNA 


<213> 


Arti f i ci al Sequence 


<220> 




<223> 


Ol igonucleotide 


<400> 


36 



ccggaattca tatggccaca agcgacgac 

<210> 37 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 37 

gcccaagctt ttaccactcg taattgac 

<210> 38 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol igonucleotide 

<400> 38 

ccggaattct tatgaaacac tttccatcc 

<210> 39 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 39 

gcccaagctt tcaacccacg ttgtaaggtt g 

<210> 40 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> 


Oligonucleotide 






<400> 


40 




5 


ccggaattct tatggccaca aacgacgacg 


30 




<210> 


41 






<211> 


31 






<212> 


DNA 






<213> 


Arti f i ci al sequence 




10 


<220> 








<223> 


Ol igonucleotide 






<400> 


41 






gcccaagctt tcaacccacg ttgtaaggtt g 


31 


15 


<210> 


42 






<211> 


34 






<212> 


DNA 






<213> 


Arti f i ci al Sequence 






<220> 






20 


<223> 


Ol igonucleotide 






<400> 


42 






ccggaattct tatggccacc tacaaagtgg acga 


34 




<210> 


43 






<211> 


32 




25 


<212> 


DNA 






<213> 


Arti f i ci al Sequence 






<220> 








<223> 


Ol igonucleotide 




30 


<400> 


43 






gcccaagctt ttattgtttg gctgcctcga tt 


32 




<210> 


44 






<211> 


32 






<212> 


DNA 




35 


<213> 


Arti f i ci al Sequence 






<220> 








<223> 


ol i gonucl eoti de 






<400> 


44 




40 


cgcggatccg ctagccccga tgttaaatcg gc 


32 




<210> 


45 






<211> 


31 






<212> 


DNA 






<213> 


Arti f i ci al Sequence 




45 


<220> 








<223> 


Ol igonucleotide 






<400> 


45 






cccgctcgag tcaatcctgc tcttttttgc c 


31 


50 


<210> 


46 






<211> 


32 






<212> 


DNA 






<213> 


Arti f i ci al Sequence 






<220> 






55 


<223> 


Ol i gonucl eoti de 






<400> 


46 
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cgcggatccg ctagccaaga tatggcggca gt 

<210> 47 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 47 

cgcggatccg ctagcgccga atccgcaaat ca 

<210> 48 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 48 

cgcgctagcg gaagggttga tttggctaat gg 

<210> 49 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 49 

gggaattcca tatgggcatt tcccgcaaaa tatc 

<210> 50 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 50 

cccgctcgag ttacgtatca tatttcacgt gc 

<210> 51 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 51 

gggaattcca tatgggcatt tcccgcaaaa tatc 

<210> 52 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 52 

cccgctcgag ttattctatg ccttgtgcgg cat 

<210> 53 
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<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 53 

cgcggatccc atatggccac aagcgacgac ga 

<210> 54 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 54 

cccgctcgag ttaccactcg taattgac 

<210> 55 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol igonucl eotide 

<400> 55 

cgcggatccc atatggccac aaacgacg 

<210> 56 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 56 

cccgctcgag tcatttagca atattatctt tgttc 

<210> 57 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 57 

cgcggatccc atatgaaagc aaacagtgcc gac 

<210> 58 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 58 

cccgctcgag ttaccactcg taattgac 

<210> 59 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
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<220> 
<223> 


01 i gonucl eoti de 


5 


<400> 59 

cgcggatccc atatggccac aaacgacg 


10 


<210> 
<211> 
<212> 
<213> 


60 
29 
DNA 

Artifi cial Sequence 




<220> 
<223> 


Ol igonucleotide 


15 


<400> 60 

cccgctcgag ttaacccacg ttgtaaggt 


<210> 
<211> 
<212> 
<213> 


61 
33 
DNA 

Arti f i ci al Sequence 


20 


<220> 
<223> 


Ol igonucleotide 




<400> 61 

cgcggatccc atatgatgaa acactttcca 


25 


<210> 
<211> 
<212> 
<213> 


62 
29 

DNA 

Arti f i ci al Sequence 


30 


<220> 
<223> 


Ol igonucleotide 




<400> 62 

cccgctcgag ttaacccacg ttgtaaggt 


35 


<210> 
<211> 
<212> 
<213> 


63 
28 
DNA 

Arti f i ci al Sequence 




<220> 
<223> 


Oligonucleotide 


40 


<400> 63 

cgcggatccc atatggccac aaacgacg 


45 


<210> 
<211> 
<212> 
<213> 


64 
32 
DNA 

Artifi cial Sequence 




<220> 
<223> 


Ol igonucleotide 


50 


<400> 64 

cccgctcgag tcagtctgac actgttttat 




<210> 
<211> 
<212> 
<213> 


65 
32 
DNA 

Artifi cial sequence 


55 


<220> 
<223> 


Ol igonucleotide 



28 



29 



33 



29 



28 



32 
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10 



30 



50 



<400> 65 

cgcggatccg ctagccccga tgttaaatcg gc 32 

<210> 66 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucl eotide 



<400> 66 

cccgctcgag ttacgggcgg tattcgg 27 

<210> 67 

<211> 32 

15 <212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

20 <400> 67 

cgcggatccg ctagccccga tgttaaatcg gc 32 

<210> 68 

<211> 32 

<212> DNA 

25 <213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 68 

cccgctcgag ttacgtatca tatttcacgt gc 32 

<210> 69 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

35 <220> 

<223> Ol i gonucl eoti de 

<400> 69 

cgcggatccg ctagccccga tgttaaatcg gc 32 

40 <210> 70 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

45 <223> Oligonucleotide 

<400> 70 

cccgctcgag ttaccactcg taattgac 28 



55 



<210> 71 

<211> 1457 

<212> PRT 

<213> Neisseria meningitidis 

<400> 71 

Met Lys Thr Thr Asp Lys Arg Thr Thr Glu Thr His Arg Lys Ala Pro 
1 5 10 15 

Lys Thr Gly Arg lie Arg Phe ser Pro Ala Tyr Leu Ala lie Cys Leu 



112 



EP 1 790 660 A2 



20 25 30 

Ser Phe Gly lie Leu Pro Gin Ala Trp Ala Gly His Thr Tyr Phe Gly 
35 40 45 

lie Asn Tyr Gin Tyr Tyr Arg Asp Phe Ala Glu Asn Lys Gly Lys Phe 
50 55 60 

Ala val Gly Ala Lys Asp lie Glu val Tyr Asn Lys Lys Gly Glu Leu 
65 70 75 80 

Val Gly Lys Ser Met Thr Lys Ala Pro Met lie Asp Phe Ser Val Val 
85 90 95 

Ser Arg Asn Gly Val Ala Ala Leu Val Gly Asp Gin Tyr lie Val Ser 
100 105 110 

Val Ala His Asn Gly Gly Tyr Asn Asn val Asp Phe Gly Ala Glu Gly 
115 120 125 

Arg Asn Pro Asp Gin His Arg Phe Thr Tyr Lys lie val Lys Arg Asn 
130 135 140 

Asn Tyr Lys Ala Gly Thr Lys Gly His Pro Tyr Gly Gly Asp Tyr His 
145 150 155 160 

Met Pro Arg Leu His Lys Phe Val Thr Asp Ala Glu Pro Val Glu Met 
165 170 175 

Thr Ser Tyr Met Asp Gly Arg Lys Tyr lie Asp Gin Asn Asn Tyr Pro 
180 185 190 

Asp Arg val Arg lie Gly Ala Gly Arg Gin Tyr Trp Arg Ser Asp Glu 
195 200 205 

Asp Glu Pro Asn Asn Arg Glu Ser Ser Tyr His lie Ala Ser Ala Tyr 
210 215 220 

Ser Trp Leu val Gly Gly Asn Thr Phe Ala Gin Asn Gly Ser Gly Gly 
225 230 235 240 

Gly Thr Val Asn Leu Gly Ser Glu Lys lie Lys His Ser Pro Tyr Gly 
245 250 255 

Phe Leu Pro Thr Gly Gly ser Phe Gly Asp ser Gly ser Pro Met Phe 
260 265 270 

lie Tyr Asp Ala Gin Lys Gin Lys Trp Leu lie Asn Gly val Leu Gin 

275 280 285 

Thr Gly Asn Pro Tyr lie Gly Lys Ser Asn Gly Phe Gin Leu val Arg 
290 295 300 

Lys Asp Trp Phe Tyr Asp Glu lie Phe Ala Gly Asp Thr His Ser val 
305 310 315 320 

Phe Tyr Glu Pro Arg Gin Asn Gly Lys Tyr Ser Phe Asn Asp Asp Asn 
325 330 335 

Asn Gly Thr Gly Lys lie Asn Ala Lys His Glu His Asn Ser Leu Pro 
340 345 350 

Asn Arg Leu Lys Thr Arg Thr val Gin Leu Phe Asn val ser Leu Ser 
355 360 365 

Glu Thr Ala Arg Glu Pro val Tyr His Ala Ala Gly Gly Val Asn Ser 
370 375 380 

Tyr Arg pro Arg Leu Asn Asn Gly Glu Asn lie Ser Phe lie Asp Glu 
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385 

Gly Lys 

Gly Leu 

Thr Trp 

Trp Lys 
450 

Gly Thr 
465 

Val Gly 

Lys Lys 

val Gin 

Gly Phe 
530 

His Arg 
545 

Gin Asp 
Thr Thr 
Asn Gly 

Asn Leu 

610 

Gly Gly 
625 

Phe Phe 
His Trp 

Asn Asp 



Gly Gly 
690 

Trp His 
705 



Gly Glu Leu 
405 

Tyr Phe Gin 
420 

Gin Gly Ala 
435 

val Asn Gly 

Leu His Val 

Asp Gly Thr 
485 

Gin Ala Phe 
500 

Leu Asn Ala 
515 

Arg Gly Gly 

lie Gin Asn 

Lys Glu ser 
565 

Gly Asn Asn 
580 

Trp Phe Gly 
595 

val Tyr Gin 
Thr Asn Leu 



Ser Gly Arg 
645 

Ser Gin Lys 
660 

Trp lie Asn 
675 

Gin Ala val 



390 

lie Leu Thr Ser 



Gly Asp 

Gly val 

val Ala 
455 

Gin Ala 
470 

val lie 
Ser Glu 
Asp Asn 



Phe Thr 

425 

His He 
440 

Asn Asp 
Lys Gly 
Leu Asp 



lie Gly 
505 

Gin Phe 
520 



395 

Asn lie 
410 

Val Ser 
Ser Glu 
Arg Leu 



Asn Gin 
Pro Glu 



Asp Ser 
445 

Ser Lys 
460 



Glu Asn 
475 



Gin Gly 
Ala Asp 
Leu val Ser Gly 



Gin Gin 
490 



Asn Pro 



Asp Leu Asn Gly 
Glu Gly 
Thr val Thr lie 



Arg Leu 
535 

Thr Asp 

550 



Ala Met 
555 



Asn Ser Leu Asp 
585 

Glu Lys Asp Thr 
600 

pro Ala Ala Glu 
615 

Asn Gly Asn lie 
630 

Pro Thr Pro His 



Glu Gly 
Arg Thr 



Leu ser Asn 



val Ser 
695 

His Ala 
710 



His Thr lie 

725 



Gin Ser 
Cys Val 
Lys Thr Asp lie Ser 



lie Pro 

665 

Phe Lys 
680 

Arg Asn 
Gin Ala 

Arg Ser 



Glu Lys Thr 
740 



Cys Thr 
He Thr 
Gly Asn val Asp 



Asp Asp 
745 



Thr Gly 
570 

Ser Lys 
Thr Lys 
Asp Arg 

Thr Gin 

635 

Ala Tyr 
650 

Arg Gly 
Ala Glu 
Val Ala 

val Phe 

715 

Asp Trp 
730 

Lys Val 
Leu Ala 



Asp Lys 
525 

His Ser 
540 

lie Val 

Asn Lys 

Lys Glu 

Thr Asn 
605 

Thr Leu 
620 

Thr Asn 

Asn His 

Glu lie 

Asn Phe 
685 

Lys val 
700 

Gly val 
Thr Gly 
He Ala 
Asp His 



400 

Gly Ala Gly 
415 

Asn Asn Glu 
430 

Thr val Thr 

lie Gly Lys 

Ser lie Ser 
480 

Asp Lys Gly 
495 

Arg Gly Thr 
510 

Leu Tyr Phe 
Leu Ser Phe 



Asn His Asn 
560 

Asp lie Ala 
575 

lie Ala Tyr 
590 

Gly Arg Leu 



Leu Leu ser 

Gly Lys Leu 
640 

Leu Asn Asp 
655 

val Trp Asp 
670 

Gin lie Lys 
Lys Gly Asp 



Ala Pro His 
720 

Leu Thr Asn 

735 

Ser Leu Thr 
750 

Ala His Leu 
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755 760 765 

Asn Leu Thr Gly Leu Ala Thr Leu Asn Gly Asn Leu Ser Ala Asn Gly 
770 775 780 

Asp Thr Arg Tyr Thr val ser His Asn Ala Thr Gin Asn Gly Asn Leu 
785 790 795 800 

Ser Leu Val Gly Asn Ala Gin Ala Thr Phe Asn Gin Ala Thr Leu Asn 
805 810 815 

Gly Asn Thr Ser Ala Ser Gly Asn Ala Ser Phe Asn Leu Ser Asp His 
820 825 830 

Ala Val Gin Asn Gly Ser Leu Thr Leu Ser Gly Asn Ala Lys Ala Asn 
835 840 845 

Val Ser His Ser Ala Leu Asn Gly Asn val Ser Leu Ala Asp Lys Ala 
850 855 860 

Val Phe His Phe Glu Ser Ser Arg Phe Thr Gly Gin lie Ser Gly Gly 
865 870 875 880 

Lys Asp Thr Ala Leu His Leu Lys Asp Ser Glu Trp Thr Leu Pro Ser 
885 890 895 

Gly Thr Glu Leu Gly Asn Leu Asn Leu Asp Asn Ala Thr lie Thr Leu 
900 905 910 

Asn Ser Ala Tyr Arg His Asp Ala Ala Gly Ala Gin Thr Gly Ser Ala 
915 920 925 

Thr Asp Ala Pro Arg Arg Arg Ser Arg Arg Ser Arg Arg Ser Leu Leu 
930 935 ~ ' 940 

Ser Val Thr Pro Pro Thr Ser val Glu Ser Arg Phe Asn Thr Leu Thr 
945 950 955 960 

Val Asn Gly Lys Leu Asn Gly Gin Gly Thr Phe Arg Phe Met Ser Glu 
965 970 975 

Leu Phe Gly Tyr Arg Ser Asp Lys Leu Lys Leu Ala Glu Ser Ser Glu 
980 985 990 

Gly Thr Tyr Thr Leu Ala Val Asn Asn Thr Gly Asn Glu Pro Ala Ser 
995 1000 1005 

Leu Glu Gin Leu Thr val val Glu Gly Lys Asp Asn Lys Pro Leu ser 
1010 1015 1020 

Glu Asn Leu Asn Phe Thr Leu Gin Asn Glu His val Asp Ala Gly Ala 
1025 1030 1035 1040 

Trp Arg Tyr Gin Leu lie Arg Lys Asp Gly Glu Phe Arg Leu His Asn 
1045 1050 1055 

Pro val Lys Glu Gin Glu Leu Ser Asp Lys Leu Gly Lys Ala Glu Ala 
1060 1065 1070 

Lys Lys Gin Ala Glu Lys Asp Asn Ala Gin Ser Leu Asp Ala Leu lie 
1075 1080 1085 

Ala Ala Gly Arg Asp Ala val Glu Lys Thr Glu Ser val Ala Glu Pro 
1090 ^ 1095 1100 

Ala Arg Gin Ala Gly Gly Glu Asn Val Gly lie Met Gin Ala Glu Glu 
1105 1110 1115 1120 

Glu Lys Lys Arg val Gin Ala Asp Lys Asp Thr Ala Leu Ala Lys Gin 
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1125 1130 1135 

Arg Glu Ala Glu Thr Arg Pro Ala Thr Thr Ala Phe Pro Arg Ala Arg 
1140 1145 1150 

Arg Ala Arg Arg Asp Leu Pro Gin Leu Gin Pro Gin Pro Gin Pro Gin 
1155 1160 1165 

Pro Gin Arg Asp Leu lie Ser Arg Tyr Ala Asn Ser Gly Leu Ser Glu 
1170 1175 1180 

Phe Ser Ala Thr Leu Asn Ser Val phe Ala Val Gin Asp Glu Leu Asp 
1185 1190 1195 1200 

Arg Val Phe Ala Glu Asp Arg Arg Asn Ala val Trp Thr Ser Gly lie 
1205 ~ 1210 1215 

Arg Asp Thr Lys His Tyr Arg Ser Gin Asp Phe Arg Ala Tyr Arg Gin 
1220 1225 1230 

Gin Thr Asp Leu Arg Gin lie Gly Met Gin Lys Asn Leu Gly ser Gly 
1235 1240 1245 

Arg val Gly lie Leu Phe ser His Asn Arg Thr Glu Asn Thr Phe Asp 
1250 1255 1260 

Asp Gly lie Gly Asn Ser Ala Arg Leu Ala His Gly Ala Val Phe Gly 
1265 1270 ~ 1275 1280 

Gin Tyr Gly lie Asp Arg Phe Tyr lie Gly lie Ser Ala Gly Ala Gly 
1285 1290 1295 

Phe Ser Ser Gly Ser Leu Ser Asp Gly lie Gly Gly Lys lie Arg Arg 
1300 1305 1310 

Arg Val Leu His Tyr Gly lie Gin Ala Arg Tyr Arg Ala Gly Phe Gly 
1315 1320 1325 

Gly Phe Gly lie Glu Pro His He Gly Ala Thr Arg Tyr Phe Val Gin 
1330 1335 1340 

Lys Ala Asp Tyr Arg Tyr Glu Asn Val Asn lie Ala Thr Pro Gly Leu 
1345 1350 1355 1360 

Ala Phe Asn Arg Tyr Arg Ala Gly lie Lys Ala Asp Tyr Ser Phe Lys 
1365 1370 1375 

Pro Ala Gin His lie Ser lie Thr Pro Tyr Leu Ser Leu Ser Tyr Thr 
1380 1385 1390 

Asp Ala Ala Ser Gly Lys val Arg Thr Arg Val Asn Thr Ala val Leu 
1395 1400 1405 

Ala Gin Asp Phe Gly Lys Thr Arg ser Ala Glu Trp Gly val Asn Ala 
1410 1415 1420 

Glu lie Lys Gly Phe Thr Leu ser Leu His Ala Ala Ala Ala Lys Gly 
1425 1430 1435 1440 

Pro Gin Leu Glu Ala Gin His Ser Ala Gly lie Lys Leu Gly Tyr Arg 
1445 1450 1455 

Trp 

<210> 72 
<211> 21 
<212> PRT 

<213> Escherichia coli 
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<400> 72 

Met Lys Lys Thr Ala lie Ala lie Ala val Ala Leu Ala Gly Phe Ala 
15 10 15 

Thr Val Ala Gin Ala 

20 

<210> 73 
<211> 1439 
<212> PRT 

<213> Neisseria meningitidis 
<400> 73 

Met Lys Lys Thr Ala lie Ala lie Ala Val Ala Leu Ala Gly Phe Ala 
15 10 15 

Thr val Ala Gin Ala Ala ser Ala Gly His Thr Tyr Phe Gly lie Asn 
20 25 30 

Tyr Gin Tyr Tyr Arg Asp Phe Ala Glu Asn Lys Gly Lys Phe Ala val 
35 40 45 

Gly Ala Lys Asp lie Glu val Tyr Asn Lys Lys Gly Glu Leu Val Gly 
50 55 60 

Lys Ser Met Thr Lys Ala Pro Met lie Asp Phe Ser Val val Ser Arg 
65 70 75 80 

Asn Gly Val Ala Ala Leu Val Gly Asp Gin Tyr lie Val Ser Val Ala 
85 90 95 

His Asn Gly Gly Tyr Asn Asn val Asp Phe Gly Ala Glu Gly Arg Asn 
100 105 110 

Pro Asp Gin His Arg Phe Thr Tyr Lys lie val Lys Arg Asn Asn Tyr 
115 ~ 120 125 

Lys Ala Gly Thr Lys Gly His Pro Tyr Gly Gly Asp Tyr His Met Pro 
130 135 140 

Arg Leu His Lys Phe val Thr Asp Ala Glu Pro Val Glu Met Thr Ser 
145 150 155 160 

Tyr Met Asp Gly Arg Lys Tyr lie Asp Gin Asn Asn Tyr Pro Asp Arg 
165 170 175 

Val Arg lie Gly Ala Gly Arg Gin Tyr Trp Arg Ser Asp Glu Asp Glu 
180 185 190 

Pro Asn Asn Arg Glu Ser Ser Tyr His lie Ala Ser Ala Tyr Ser Trp 
195 200 205 

Leu val Gly Gly Asn Thr Phe Ala Gin Asn Gly Ser Gly Gly Gly Thr 
210 215 220 

Val Asn Leu Gly Ser Glu Lys lie Lys His Ser Pro Tyr Gly Phe Leu 
225 230 235 240 

Pro Thr Gly Gly Ser Phe Gly Asp Ser Gly Ser Pro Met Phe lie Tyr 
245 250 255 

Asp Ala Gin Lys Gin Lys Trp Leu lie Asn Gly Val Leu Gin Thr Gly 
260 265 270 

Asn Pro Tyr lie Gly Lys Ser Asn Gly Phe Gin Leu Val Arg Lys Asp 
275 280 285 

Trp Phe Tyr Asp Glu He Phe Ala Gly Asp Thr His Ser val Phe Tyr 
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290 295 300 

Glu Pro Arg Gin Asn Gly Lys Tyr Ser Phe Asn Asp Asp Asn Asn Gly 
305 310 315 320 

Thr Gly Lys lie Asn Ala Lys His Glu His Asn Ser Leu Pro Asn Arg 
325 330 335 

Leu Lys Thr Arg Thr val Gin Leu Phe Asn val ser Leu ser Glu Thr 
340 345 350 

Ala Arg Glu Pro Val Tyr His Ala Ala Gly Gly val Asn Ser Tyr Arg 
355 360 365 

Pro Arg Leu Asn Asn Gly Glu Asn lie Ser Phe lie Asp Glu Gly Lys 
370 375 380 

Gly Glu Leu lie Leu Thr ser Asn lie Asn Gin Gly Ala Gly Gly Leu 
385 390 395 400 

Tyr Phe Gin Gly Asp Phe Thr val Ser Pro Glu Asn Asn Glu Thr Trp 
405 410 415 

Gin Gly Ala Gly Val His lie Ser Glu Asp Ser Thr Val Thr Trp Lys 
420 425 430 

val Asn Gly Val Ala Asn Asp Arg Leu Ser Lys lie Gly Lys Gly Thr 
435 440 445 

Leu His val Gin Ala Lys Gly Glu Asn Gin Gly ser lie ser val Gly 
450 455 460 

Asp Gly Thr Val lie Leu Asp Gin Gin Ala Asp Asp Lys Gly Lys Lys 
465 470 475 480 

Gin Ala Phe Ser Glu lie Gly Leu Val Ser Gly Arg Gly Thr Val Gin 
485 490 495 

Leu Asn Ala Asp Asn Gin Phe Asn Pro Asp Lys Leu Tyr Phe Gly Phe 
500 505 510 

Arg Gly Gly Arg Leu Asp Leu Asn Gly His Ser Leu Ser Phe His Arg 
515 520 525 

lie Gin Asn Thr Asp Glu Gly Ala Met lie val Asn His Asn Gin Asp 
530 535 540 

Lys Glu Ser Thr val Thr lie Thr Gly Asn Lys Asp lie Ala Thr Thr 
545 550 555 560 

Gly Asn Asn Asn Ser Leu Asp Ser Lys Lys Glu lie Ala Tyr Asn Gly 
565 570 575 

Trp Phe Gly Glu Lys Asp Thr Thr Lys Thr Asn Gly Arg Leu Asn Leu 
580 585 590 

Val Tyr Gin Pro Ala Ala Glu Asp Arg Thr Leu Leu Leu Ser Gly Gly 
595 600 605 

Thr Asn Leu Asn Gly Asn lie Thr Gin Thr Asn Gly Lys Leu Phe Phe 
610 615 620 

Ser Gly Arg Pro Thr Pro His Ala Tyr Asn His Leu Asn Asp His Trp 
625 630 635 640 

Ser Gin Lys Glu Gly lie Pro Arg Gly Glu lie val Trp Asp Asn Asp 
645 650 65 5 

Trp He Asn Arg Thr Phe Lys Ala Glu Asn Phe Gin lie Lys Gly Gly 
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660 665 670 

Gin Ala Val Val Ser Arg Asn val Ala Lys val Lys Gly Asp Trp His 
675 680 685 

Leu Ser Asn His Ala Gin Ala val Phe Gly val Ala Pro His Gin ser 
690 695 700 

His Thr lie Cys Thr Arg Ser Asp Trp Thr Gly Leu Thr Asn Cys val 
705 710 715 720 

Glu Lys Thr lie Thr Asp Asp Lys Val lie Ala Ser Leu Thr Lys Thr 

725 730 735 

Asp lie ser Gly Asn val Asp Leu Ala Asp His Ala His Leu Asn Leu 
740 745 750 

Thr Gly Leu Ala Thr Leu Asn Gly Asn Leu Ser Ala Asn Gly Asp Thr 
755 760 765 

Arg Tyr Thr Val Ser His Asn Ala Thr Gin Asn Gly Asn Leu Ser Leu 
770 775 780 

val Gly Asn Ala Gin Ala Thr Phe Asn Gin Ala Thr Leu Asn Gly Asn 
785 790 795 800 

Thr Ser Ala Ser Gly Asn Ala Ser Phe Asn Leu Ser Asp His Ala val 
805 810 815 

Gin Asn Gly ser Leu Thr Leu ser Gly Asn Ala Lys Ala Asn val Ser 
820 825 830 

His Ser Ala Leu Asn Gly Asn val Ser Leu Ala Asp Lys Ala val Phe 
835 840 845 

His Phe Glu Ser Ser Arg Phe Thr Gly Gin lie Ser Gly Gly Lys Asp 
850 855 860 

Thr Ala Leu His Leu Lys Asp Ser Glu Trp Thr Leu Pro Ser Gly Thr 
865 870 875 880 

Glu Leu Gly Asn Leu Asn Leu Asp Asn Ala Thr lie Thr Leu Asn Ser 
885 890 895 

Ala Tyr Arg His Asp Ala Ala Gly Ala Gin Thr Gly Ser Ala Thr Asp 
900 905 910 

Ala Pro Arg Arg Arg Ser Arg Arg Ser Arg Arg Ser Leu Leu Ser Val 
915 ~ 920 ~ 92 5 

Thr Pro Pro Thr ser val Glu ser Arg Phe Asn Thr Leu Thr val Asn 
930 935 940 

Gly Lys Leu Asn Gly Gin Gly Thr Phe Arg Phe Met Ser Glu Leu Phe 
945 950 955 960 

Gly Tyr Arg ser Asp Lys Leu Lys Leu Ala Glu ser ser Glu Gly Thr 
965 970 975 

Tyr Thr Leu Ala Val Asn Asn Thr Gly Asn Glu Pro Ala Ser Leu Glu 
980 985 990 

Gin Leu Thr Val Val Glu Gly Lys Asp Asn Lys Pro Leu Ser Glu Asn 
995 1000 1005 

Leu Asn Phe Thr Leu Gin Asn Glu His val Asp Ala Gly Ala Trp Arg 
1010 1015 1020 

Tyr Gin Leu lie Arg Lys Asp Gly Glu Phe Arg Leu His Asn Pro val 
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1025 1030 1035 1040 

Lys Glu Gin Glu Leu Ser Asp Lys Leu Gly Lys Ala Glu Ala Lys Lys 
1045 1050 1055 

Gin Ala Glu Lys Asp Asn Ala Gin Ser Leu Asp Ala Leu lie Ala Ala 
1060 1065 1070 

Gly Arg Asp Ala val Glu Lys Thr Glu Ser Val Ala Glu Pro Ala Arg 
1075 1080 1085 

Gin Ala Gly Gly Glu Asn val Gly lie Met Gin Ala Glu Glu Glu Lys 
1090 1095 1100 

Lys Arg val Gin Ala Asp Lys Asp Thr Ala Leu Ala Lys Gin Arg Glu 
1105 1110 1115 1120 

Ala Glu Thr Arg Pro Ala Thr Thr Ala Phe Pro Arg Ala Arg Arg Ala 
1125 1130 ~ 1135 

Arg Arg Asp Leu Pro Gin Leu Gin Pro Gin Pro Gin Pro Gin Pro Gin 
1140 1145 1150 

Arg Asp Leu lie ser Arg Tyr Ala Asn ser Gly Leu ser Glu Phe ser 
1155 1160 1165 

Ala Thr Leu Asn Ser Val Phe Ala Val Gin Asp Glu Leu Asp Arg Val 
1170 1175 1180 

Phe Ala Glu Asp Arg Arg Asn Ala val Trp Thr Ser Gly lie Arg Asp 
1185 1190 1195 1200 

Thr Lys His Tyr Arg Ser Gin Asp Phe Arg Ala Tyr Arg Gin Gin Thr 
1205 1210 1215 

Asp Leu Arg Gin lie Gly Met Gin Lys Asn Leu Gly ser Gly Arg val 
1220 1225 1230 

Gly lie Leu Phe Ser His Asn Arg Thr Glu Asn Thr Phe Asp Asp Gly 
1235 1240 1245 

lie Gly Asn Ser Ala Arg Leu Ala His Gly Ala val Phe Gly Gin Tyr 
1250 1255 1260 

Gly lie Asp Arg Phe Tyr lie Gly lie Ser Ala Gly Ala Gly Phe Ser 
1265 1270 1275 1280 

Ser Gly Ser Leu Ser Asp Gly lie Gly Gly Lys lie Arg Arg Arg Val 
1285 1290 1295 

Leu His Tyr Gly lie Gin Ala Arg Tyr Arg Ala Gly Phe Gly Gly Phe 
1300 1305 1310 

Gly lie Glu Pro His lie Gly Ala Thr Arg Tyr Phe val Gin Lys Ala 
1315 1320 1325 

Asp Tyr Arg Tyr Glu Asn val Asn He Ala Thr Pro Gly Leu Ala Phe 
1330 1335 1340 

Asn Arg Tyr Arg Ala Gly lie Lys Ala Asp Tyr Ser Phe Lys Pro Ala 
1345 1350 1355 1360 

Gin His lie Ser lie Thr Pro Tyr Leu Ser Leu Ser Tyr Thr Asp Ala 
1365 1370 1375 

Ala ser Gly Lys val Arg Thr Arg val Asn Thr Ala val Leu Ala Gin 
1380 1385 1390 

Asp Phe Gly Lys Thr Arg ser Ala Glu Trp Gly val Asn Ala Glu lie 
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1395 1400 1405 

Lys Gly Phe Thr Leu Ser Leu His Ala Ala Ala Ala Lys Gly Pro Gin 
1410 1415 1420 

Leu Glu Ala Gin His Ser Ala Gly lie Lys Leu Gly Tyr Arg Trp 
1425 1430 1435 

<210> 74 
<211> 164 
<212> PRT 

<213> Neisseria meningitidis 
<400> 74 

Met Lys Lys Asn lie Leu Glu Phe Trp val Gly Leu Phe val Leu lie 
15 10 15 

Gly Ala Ala Ala val Ala Phe Leu Ala Phe Arg val Ala Gly Gly Ala 
20 25 30 

Ala Phe Gly Gly Ser Asp Lys Thr Tyr Ala val Tyr Ala Asp Phe Gly 
35 40 45 

Asp lie Gly Gly Leu Lys val Asn Ala Pro val Lys Ser Ala Gly val 
50 55 60 

Leu val Gly Arg val Gly Ala He Gly Leu Asp Pro Lys Ser Tyr Gin 
65 70 75 80 

Ala Arg val Arg Leu Asp Leu Asp Gly Lys Tyr Gin Phe Ser Ser Asp 
85 90 95 

val ser Ala Gin lie Leu Thr ser Gly Leu Leu Gly Glu Gin Tyr lie 
100 105 110 

Gly Leu Gin Gin Gly Gly Asp Thr Glu Asn Leu Ala Ala Gly Asp Thr 

115 120 125 

lie Ser val Thr Ser Ser Ala Met val Leu Glu Asn Leu lie Gly Lys 
130 135 140 

Phe Met Thr Ser Phe Ala Glu Lys Asn Ala Asp Gly Gly Asn Ala Glu 
145 150 155 160 

Lys Ala Ala Glu 

<210> 75 
<211> 21 
<212> PRT 

<213> Erwinia carotovora 
<400> 75 

Met Lys Tyr Leu Leu Pro Thr Ala Ala Ala Gly Leu Leu Leu Ala Ala 
15 10 15 

Gin Pro Ala Met Ala 
20 

<210> 76 
<211> 608 

<212> PRT 

<213> Neisseria meningitidis 0RF46 
<400> 76 

Leu Gly lie Ser Arg Lys lie Ser Leu lie Leu Ser lie Leu Ala val 
15 10 15 

cys Leu Pro Met His Ala His Ala Ser Asp Leu Ala Asn Asp Ser Phe 
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20 25 30 

lie Arg Gin Val Leu Asp Arg Gin His Phe Glu Pro Asp Gly Lys Tyr 
35 40 45 

His Leu Phe Gly Ser Arg Gly Glu Leu Ala Glu Arg Ser Gly His lie 
50 55 60 

Gly Leu Gly Lys lie Gin ser His Gin Leu Gly Asn Leu Met lie Gin 
65 70 75 80 

Gin Ala Ala lie Lys Gly Asn lie Gly Tyr lie val Arg Phe Ser Asp 
85 90 95 

His Gly His Glu Val His Ser Pro Phe Asp Asn His Ala Ser His Ser 
100 105 110 

Asp Ser Asp Glu Ala Gly Ser Pro Val Asp Gly Phe Ser Leu Tyr Arg 
115 120 125 

lie His Trp Asp Gly Tyr Glu His His Pro Ala Asp Gly Tyr Asp Gly 
130 135 140 

Pro Gin Gly Gly Gly Tyr Pro Ala Pro Lys Gly Ala Arg Asp lie Tyr 
145 150 155 ~ 160 

Ser Tyr Asp lie Lys Gly val Ala Gin Asn lie Arg Leu Asn Leu Thr 
165 170 175 

Asp Asn Arg ser Thr Gly Gin Arg Leu Ala Asp Arg Phe His Asn Ala 
180 185 190 

Gly Ser Met Leu Thr Gin Gly Val Gly Asp Gly Phe Lys Arg Ala Thr 
195 200 205 

Arg Tyr Ser Pro Glu Leu Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe 
210 215 " 220 

Asn Gly Thr Ala Asp lie Val Lys Asn lie He Gly Ala Ala Gly Glu 
225 230 235 240 

lie val Gly Ala Gly Asp Ala val Gin Gly He Ser Glu Gly Ser Asn 
245 250 255 

lie Ala val Met His Gly Leu Gly Leu Leu ser Thr Glu Asn Lys Met 
260 265 270 

Ala Arg lie Asn Asp Leu Ala Asp Met Ala Gin Leu Lys Asp Tyr Ala 
275 280 285 

Ala Ala Ala lie Arg Asp Trp Ala val Gin Asn Pro Asn Ala Ala Gin 
290 ~ 295 300 

Gly lie Glu Ala val Ser Asn lie Phe Met Ala Ala lie Pro He Lys 
305 310 315 320 

Gly lie Gly Ala val Arg Gly Lys Tyr Gly Leu Gly Gly lie Thr Ala 
325 330 335 

His Pro lie Lys Arg ser Gin Met Gly Ala lie Ala Leu Pro Lys Gly 
340 345 350 

Lys Ser Ala val Ser Asp Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr 
35 5 360 365 

Pro Ser Pro Tyr His Ser Arg Asn lie Arg Ser Asn Leu Glu Gin Arg 
370 375 ~ 380 

Tyr Gly Lys Glu Asn lie Thr Ser Ser Thr Val Pro Pro Ser Asn Gly 
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385 



390 



395 



400 



Lys Asn Val Lys Leu Ala Asp Gin Arg His Pro Lys Thr Gly Val Pro 
405 410 415 

Phe Asp Gly Lys Gly Phe Pro Asn Phe Glu Lys His val Lys Tyr Asp 
420 425 430 

Thr Lys Leu Asp lie Gin Glu Leu Ser Gly Gly Gly lie Pro Lys Ala 
435 440 445 

Lys Pro val ser Asp Ala Lys Pro Arg Trp Glu val Asp Arg Lys Leu 
450 455 460 

Asn Lys Leu Thr Thr Arg Glu Gin Val Glu Lys Asn Val Gin Glu lie 
465 470 475 480 

Arg Asn Gly Asn Lys Asn ser Asn Phe ser Gin His Ala Gin Leu Glu 
485 490 495 

Arg Glu lie Asn Lys Leu Lys Ser Ala Asp Glu lie Asn Phe Ala Asp 
500 505 510 

Gly Met Gly Lys Phe Thr Asp Ser Met Asn Asp Lys Ala Phe Ser Arg 
515 520 525 

Leu Val Lys Ser val Lys Glu Asn Gly Phe Thr Asn Pro Val Val Glu 
530 535 540 

Tyr Val Glu lie Asn Gly Lys Ala Tyr lie val Arg Gly Asn Asn Arg 
545 550 555 560 

val Phe Ala Ala Glu Tyr Leu Gly Arg lie His Glu Leu Lys Phe Lys 
565 ~ 570 575 

Lys Val Asp Phe Pro Val Pro Asn Thr Ser Trp Lys Asn Pro Thr Asp 
580 585 590 

val Leu Asn Glu Ser Gly Asn Val Lys Arg Pro Arg Tyr Arg ser Lys 
595 600 ~* 605 



<210> 77 

<211> 584 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 0RF46-2 

<400> 77 

ser Asp Leu Ala Asn Asp ser Phe lie Arg Gin val Leu Asp Arg Gin 
1 5 10 15 

His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly Glu 
20 25 30 

Leu Ala Glu Arg Ser Gly His lie Gly Leu Gly Lys lie Gin ser His 
35 40 45 

Gin Leu Gly Asn Leu Met lie Gin Gin Ala Ala lie Lys Gly Asn lie 
50 55 60 

Gly Tyr lie Val Arg Phe ser Asp His Gly His Glu Val His Ser Pro 
65 70 75 80 

Phe Asp Asn His Ala ser His Ser Asp Ser Asp Glu Ala Gly Ser Pro 



90 



95 
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val Asp Gly Phe Ser Leu Tyr Arg lie His Trp Asp Gly Tyr Glu His 
100 105 110 

His Pro Ala Asp Gly Tyr Asp Gly Pro Gin Gly Gly Gly Tyr Pro Ala 
115 120 125 

Pro Lys Gly Ala Arg Asp lie Tyr Ser Tyr Asp lie Lys Gly Val Ala 
130 135 140 

Gin Asn lie Arg Leu Asn Leu Thr Asp Asn Arg ser Thr Gly Gin Ar 
145 150 155 16' 

Leu Ala Asp Arg Phe His Asn Ala Gly ser Met Leu Thr Gin Gly val 
165 170 175 

Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr ser Pro Glu Leu Asp Arg 
180 185 190 

Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp lie val Lys 
195 200 205 

Asn lie lie Gly Ala Ala Gly Glu lie val Gly Ala Gly Asp Ala Val 
210 215 220 

Gin Gly lie Ser Glu Gly Ser Asn lie Ala val Met His Gly Leu Gly 
225 230 235 240 

Leu Leu Ser Thr Glu Asn Lys Met Ala Arg lie Asn Asp Leu Ala Asp 
245 250 255 

Met Ala Gin Leu Lys Asp Tyr Ala Ala Ala Ala lie Arg Asp Trp Ala 
260 265 270 

Val Gin Asn Pro Asn Ala Ala Gin Gly lie Glu Ala Val Ser Asn lie 
275 280 285 

Phe Met Ala Ala lie Pro lie Lys Gly lie Gly Ala Val Arg Gly Lys 
290 295 300 

Tyr Gly Leu Gly Gly lie Thr Ala His Pro lie Lys Arg Ser Gin Met 
305 310 315 320 

Gly Ala lie Ala Leu Pro Lys Gly Lys ser Ala val ser Asp Asn Phe 
325 330 335 

Ala Asp Ala Ala Tyr Ala Lys Tyr pro ser Pro Tyr His ser Arg Asn 
340 345 350 

lie Arg ser Asn Leu Glu Gin Arg Tyr Gly Lys Glu Asn lie Thr ser 
355 360 365 

Ser Thr val Pro Pro Ser Asn Gly Lys Asn val Lys Leu Ala Asp Gin 
370 375 380 

Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro Asn 
385 390 395 400 

Phe Glu Lys His Val Lys Tyr Asp Thr Lys Leu Asp lie Gin Glu Leu 
405 410 415 

Ser Gly Gly Gly lie Pro Lys Ala Lys Pro Val Ser Asp Ala Lys Pro 
420 425 430 

Arg Trp Glu val Asp Arg Lys Leu Asn Lys Leu Thr Thr Arg Glu Gin 
435 440 445 

Val Glu Lys Asn Val Gin Glu lie Arg Asn Gly Asn Lys Asn Ser Asn 
450 455 460 
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Phe Ser 
465 


Gin 


His 


Ala 


Gin 
470 


Leu 


Glu 


Arg 


Glu 


He 
475 


Asn 


Lys 


Leu 


Lys 


ser 
480 


Ala Asp 


Glu 


lie 


Asn 
485 


Phe Ala Asp 


Gly 


Met 
490 


Gly 


Lys 


Phe 


Thr 


ASP 
495 


ser 


Met Asn 


Asp 


Lys 
500 


Ala 


Phe 


Ser 


Arg 


Leu 

505 


val 


Lys 


ser 


val 


Lys 
510 


Glu 


Asn 


Gly Phe 


Thr 
515 


Asn 


Pro 


val 


val 


Glu 

520 


Tyr 


val 


Glu 


He 


Asn 
525 


Gly 


Lys 


Ala 


Tyr lie 
530 


val 


Arg Gly Asn Asn Arg 
535 


val 


Phe 


Ala 


Ala 
540 


Glu 


Tyr 


Leu Gly 


Arq lie 

545 


His 


Glu 


Leu 


Lys 
550 


Phe 


Lys 


Lys 


val 


ASP 

555 


Phe 


Pro 


val 


pro 


Asn 
560 


Thr Ser 


Trp 


Lys 


Asn 
565 


Pro 


Thr 


Asp 


Val 


Leu 
570 


Asn 


Glu 


Ser 


Gly 


Asn 
575 


val 


Lys Arg 


Pro 


Arg 
580 


Tyr 


Arg 


Ser 


Lys 


















<210> 
<211> 
<212> 
<213> 


78 
364 

PRT 

Neisseria meningitidis 


















<400> 
Met Ser 
1 


78 

Mot 


1 \/c 

Lys 


U-i e 

5 


phn 
rile 


rl U 


M 1 d 


Lys 


val 
10 


Leu 


Thr 


Thr 


Ala 


He 
15 


Leu 


Ala Thr 


Phe 


cys 
20 


Ser 


Gly 


Ala 


Leu 


Ala 

25 


Ala 


Thr 


Ser 


Asp 


Asp 
30 


Asp 


val 


Lys Lys 


Ala 
35 


Ala 


Thr 


val 


Ala 


He 
40 


Val 


Ala 


Ala 


Tyr 


Asn 
45 


Asn 


Gly Gin 


Glu lie 
50 


Asn 


Gly 


Phe 


Lys 


Ala 
55 


Gly 


Glu 


Thr 


He 


Tyr 
60 


ASp 


He 


Gly Glu 


Asp Gly 
65 


Thr 


He 


Thr 


Gin 

70 


Lys 


Asp 


Ala 


Thr 


Ala 

75 


Ala 


Asp 


val 


Glu 


Ala 
80 


ASP Asp 


Phe 


Lys 


Gly 
85 


Leu 


Gly 


Leu 


Lys 


Lys 
90 


val 


Val 


Thr 


Asn 


Leu 
95 


Thr 


Lys Thr 


val 


Asn 
100 


Glu 


Asn 


Lys 


Gin 


Asn 
105 


val 


Asp 


Ala 


Lys 


val 
110 


Lys 


Ala 


Ala Glu 


ser 
115 


Glu 


He 


Glu 


Lys 


Leu 
120 


Thr 


Thr 


Lys 


Leu 


Ala 
125 


ASp 


Thr 


Asp 


Ala Ala 
130 


Leu 


Ala 


Asp 


Thr 


Asp 

135 


Ala 


Ala 


Leu 


ASp 


Glu 
140 


Thr 


Thr 


Asn 


Ala 


Leu Asn 
145 


Lys 


Leu 


Gly 


Glu 
150 


Asn 


He 


Thr 


Thr 


Phe 

155 


Ala 


Glu 


Glu 


Thr 


Lys 
160 


Thr Asn 


He 


Val 


Lys 
165 


lie 


Asp 


Glu 


Lys 


Leu 
170 


Glu 


Ala 


Val 


Ala 


Asp 
175 


Thr 


val Asp 


Lys 


His 
180 


Ala 


Glu 


Ala 


Phe 


Asn 
185 


Asp 


He 


Ala 


Asp 


Ser 
190 


Leu 


Asp 


Glu Thr 


Asn 
195 


Thr 


Lys 


Ala 


ASp 


Glu 
200 


Ala 


val 


Lys 


Thr 


Ala 
205 


Asn 


Glu 


Ala 
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Lys Gin Thr Ala Glu Glu Thr Lys Gin Asn val Asp Ala Lys Val Lys 
210 215 220 

Ala Ala Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly Thr Ala 

225 230 235 240 

Asn Thr Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val Thr Asp 
245 250 255 

lie Lys Ala Asp lie Ala Thr Asn Lys Ala Asp lie Ala Lys Asn Ser 
260 265 270 

Ala Arg lie Asp Ser Leu Asp Lys Asn Val Ala Asn Leu Arg Lys Glu 
275 280 285 

Thr Arg Gin Gly Leu Ala Glu Gin Ala Ala Leu Ser Gly Leu Phe Gin 
290 295 300 

Pro Tyr Asn val Gly Arg Phe Asn val Thr Ala Ala val Gly Gly Tyr 
305 310 315 320 

Lys Ser Glu Ser Ala Val Ala lie Gly Thr Gly Phe Arg Phe Thr Glu 

325 330 335 

Asn Phe Ala Ala Lys Ala Gly Val Ala val Gly Thr Ser Ser Gly Ser 
340 345 350 

Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp 
355 360 

<210> 79 
<211> 427 
<212> PRT 

<213> Neisseria meningitidis 
<400> 79 

Met Phe Glu Arg Ser Val lie Ala Met Ala Cys lie Phe Ala Leu Ser 
15 10 15 

Ala Cys Gly Gly Gly Gly Gly Gly Ser Pro Asp Val Lys Ser Ala Asp 
20 25 30 

Thr Leu Ser Lys Pro Ala Ala Pro val val Ala Glu Lys Glu Thr Glu 
35 40 45 

Val Lys Glu Asp Ala Pro Gin Ala Gly Ser Gin Gly Gin Gly Ala Pro 
50 55 60 

Ser Thr Gin Gly Ser Gin Asp Met Ala Ala val Ser Ala Glu Asn Thr 
65 70 75 80 

Gly Asn Gly Gly Ala Ala Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu 
85 90 95 

Gly Pro Gin Asn Asp Met Pro Gin Asn Ser Ala Glu Ser Ala Asn Gin 
100 105 110 

Thr Gly Asn Asn Gin Pro Ala Asp Ser Ser Asp Ser Ala Pro Ala Ser 
115 120 125 

Asn Pro Ala Pro Ala Asn Gly Gly Ser Asn Phe Gly Arg val Asp Leu 
130 135 140 

Ala Asn Gly Val Leu lie Asp Gly Pro Ser Gin Asn lie Thr Leu Thr 
145 150 155 160 

His Cys Lys Gly Asp Ser Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu 
165 170 175 
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Ala Pro Ser Lys Ser Glu Phe Glu Asn Leu Asn Glu Ser Glu Arg lie 
180 185 190 

Glu Lys Tyr Lys Lys Asp Gly Lys Ser Asp Lys Phe Thr Asn Leu val 
195 200 205 

Ala Thr Ala Val Gin Ala Asn Gly Thr Asn Lys Tyr Val lie lie Tyr 
210 215 220 

Lys Asp Lys Ser Ala Ser ser ser ser Ala Arg Phe Arg Arg ser Ala 
225 230 235 240 

Arg Ser Arg Arg Ser Leu Pro Ala Glu Met Pro Leu lie Pro Val Asn 
245 250 255 

Gin Ala Asp Thr Leu He Val Asp Gly Glu Ala val Ser Leu Thr Gly 
260 265 270 

His Ser Gly Asn lie Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr 
275 280 285 

Tyr Gly Ala Glu Lys Leu Pro Gly Gly Ser Tyr Ala Leu Arg val Gin 
290 295 300 

Gly Glu Pro Ala Lys Gly Glu Met Leu Ala Gly Thr Ala val Tyr Asn 
305 310 315 320 

Gly Glu Val Leu His Phe His Thr Glu Asn Gly Arg Pro Tyr Pro Thr 

325 330 335 

Arg Gly Arg Phe Ala Ala Lys Val Asp Phe Gly Ser Lys Ser Val Asp 
340 345 350 

Gly lie lie Asp Ser Gly Asp Asp Leu His Met Gly Thr Gin Lys Phe 
355 360 365 

Lys Ala Ala lie Asp Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn 
370 375 380 

Gly Gly Gly Asp val Ser Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu 
385 390 395 400 

val Ala Gly Lys Tyr Ser Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly 
405 ~ 410 415 

Phe Gly val Phe Ala Gly Lys Lys Glu Gin Asp 
420 ' 425 

<210> 80 
<211> 410 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> 287untagged 
<400> 80 

Cys Gly Gly Gly Gly Gly Gly Ser Pro Asp val Lys Ser Ala Asp Thr 
1 5 10 15 

Leu Ser Lys Pro Ala Ala Pro val val Ala Glu Lys Glu Thr Glu val 
20 25 30 

Lys Glu Asp Ala Pro Gin Ala Gly Ser Gin Gly Gin Gly Ala Pro Ser 
35 40 45 

Thr Gin Gly ser Gin Asp Met Ala Ala val ser Ala Glu Asn Thr Gly 
50 55 60 
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Asn Gly Gly Ala Ala Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly 
65 70 75 80 

Pro Gin Asn Asp Met Pro Gin Asn ser Ala Glu ser Ala Asn Gin Thr 
85 90 95 

Gly Asn Asn Gin Pro Ala Asp Ser Ser Asp Ser Ala Pro Ala Ser Asn 
100 105 110 

Pro Ala Pro Ala Asn Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala 
115 120 125 

Asn Gly val Leu lie Asp Gly Pro Ser Gin Asn lie Thr Leu Thr His 
130 135 140 

Cys Lys Gly Asp Ser Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu Ala 
145 150 155 160 

Pro Ser Lys Ser Glu Phe Glu Asn Leu Asn Glu Ser Glu Arg lie Glu 
165 170 175 

Lys Tyr Lys Lys Asp Gly Lys Ser Asp Lys Phe Thr Asn Leu Val Ala 
180 185 190 

Thr Ala Val Gin Ala Asn Gly Thr Asn Lys Tyr val lie lie Tyr Lys 
195 200 ' 205 

Asp Lys Ser Ala Ser Ser Ser Ser Ala Arg Phe Arg Arg ser Ala Arg 
210 215 220 

Ser Arg Arg Ser Leu Pro Ala Glu Met Pro Leu lie Pro Val Asn Gin 
225 230 235 240 

Ala Asp Thr Leu lie Val Asp Gly Glu Ala val Ser Leu Thr Gly His 
245 250 255 

Ser Gly Asn lie Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr 
260 265 270 

Gly Ala Glu Lys Leu Pro Gly Gly ser Tyr Ala Leu Arg val Gin Gly 
275 280 285 

Glu Pro Ala Lys Gly Glu Met Leu Ala Gly Thr Ala val Tyr Asn Gly 
290 295 300 

Glu Val Leu His Phe His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg 
305 310 315 320 

Gly Arg Phe Ala Ala Lys Val Asp Phe Gly Ser Lys Ser Val Asp Gly 
325 330 335 

lie lie Asp Ser Gly Asp Asp Leu His Met Gly Thr Gin Lys Phe Lys 
340 345 350 

Ala Ala lie Asp Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly 
355 360 365 

Gly Gly Asp val ser Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu val 
370 375 380 

Ala Gly Lys Tyr Ser Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe 
385 390 395 400 

Gly Val Phe Ala Gly Lys Lys Glu Gin Asp 



405 



410 



<210> 
<211> 
<212> 



81 
9 

PRT 
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<213> Arti f i ci al Sequence 

<220> 

<223> 920L N-terminal 
<400> 81 

His Arg val Trp val Glu Thr Ala His 



<210> 82 
<211> 16 
<212> PRT 

<213> Artificial Sequence 

<220> 

<223> 953L N-terminal 
<400> 82 

Ala Thr Tyr Lys Val Asp Glu Tyr His Ala Asn Ala Arg Phe Ala Phe 
15 10 15 



<210> 83 

<211> 9 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 519. 1L N-terminal 

<400> 83 

Met Glu Phe Phe lie lie Leu Leu Ala 
1 5 

<210> 84 

<211> 488 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> deltaG287 

<400> 84 

Met Phe Lys Arg Ser val 
1 5 

Ala cys Gly Gly Gly Gly 
20 

Thr Leu Ser Lys Pro Ala 
35 

Ala Lys Glu Asp Ala pro 
50 

Ser Ala Gin Gly Ser Gin 
65 70 

Gly Asn Gly Gly Ala val 
85 

val Ala Gin Asn Asp Met 
100 

Thr Pro Asn His Thr Pro 
115 

Asn Gin Ala Thr Asp Ala 
130 



lie Ala Met 

Gly Gly Ser 
25 

Ala Pro val 
40 

Gin Ala Gly 

55 

Asp Met Ala 
Thr Ala Asp 



Pro Gin Asn 
105 

Asp Pro Asn 
120 

Gly Glu Ser 
135 



Ala Cys 
10 

Pro Asp 

val ser 

Ser Gin 

Ala Val 
75 

Asn Pro 
90 

Ala Ala 
Met Leu 
Ser Gin 



lie Phe 

val Lys 

Glu Lys 
45 

Gly Gin 
60 

Ser Glu 
Lys Asn 
Gly Thr 



Ala Gly 
12 5 

Pro Ala 
140 



Ala Leu Ser 
15 

Ser Ala Asp 
30 

Glu Thr Glu 

Gly Ala Pro 

Glu Asn Thr 
80 

Glu Asp Glu 
95 

Asp Ser ser 
110 

Asn Met Glu 
Asn Gin Pro 
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Asp Met Ala Asn Ala Ala Asp Gly Met Gin Gly Asp Asp Pro ser Ala 
145 150 155 160 

Gly Gly Gin Asn Ala Gly Asn Thr Ala Ala Gin Gly Ala Asn Gin Ala 
165 170 175 

Gly Asn Asn Gin Ala Ala Gly Ser Ser Asp Pro He Pro Ala Ser Asn 
180 185 190 

Pro Ala Pro Ala Asn Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala 
195 200 205 

Asn Gly Val Leu lie Asp Gly Pro Ser Gin Asn lie Thr Leu Thr His 
210 215 220 

Cys Lys Gly Asp Ser Cys Ser Gly Asn Asn Phe Leu Asp Glu Glu Val 
225 230 235 240 

Gin Leu Lys ser Glu Phe Glu Lys Leu ser Asp Ala Asp Lys lie Ser 
245 250 255 

Asn Tyr Lys Lys Asp Gly Lys Asn Asp Lys Phe Val Gly Leu Val Ala 
260 265 270 

Asp Ser val Gin Met Lys Gly lie Asn Gin Tyr lie lie Phe Tyr Lys 
275 280 285 

Pro Lys Pro Thr Ser Phe Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg 
290 295 300 

Arg Ser Leu Pro Ala Glu Met Pro Leu lie Pro val Asn Gin Ala Asp 
305 310 315 320 

Thr Leu lie val Asp Gly Glu Ala val Ser Leu Thr Gly His Ser Gly 
325 330 335 

Asn lie Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala 
340 345 350 

Glu Lys Leu Pro Gly Gly Ser Tyr Ala Leu Arg val Gin Gly Glu Pro 

3 55 360 " 365 

Ala Lys Gly Glu Met Leu Ala Gly Ala Ala val Tyr Asn Gly Glu val 
370 375 380 

Leu His Phe His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg 
385 390 395 400 

Phe Ala Ala Lys val Asp Phe Gly ser Lys ser val Asp Gly lie lie 
405 410 415 

Asp Ser Gly Asp Asp Leu His Met Gly Thr Gin Lys Phe Lys Ala Ala 
420 425 430 

lie Asp Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Ser Gly 
435 440 445 

Asp val ser Gly Lys Phe Tyr Gly Pro Ala Gly Glu Glu val Ala Gly 
450 455 460 

Lys Tyr Ser Tyr Arg Pro Thr Asp Ala Glu . Lys Gly Gly Phe Gly Val 
465 " 470 475 480 

Phe Ala Gly Lys Lys Glu Gin Asp 



485 



<210> 
<211> 



85 
712 



130 
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<212> 
<213> 


PRT 

Arti f i ci al Sequence 


















<220> 
<223> 


TBP2 
























<400> 
Met Asn 
1 


85 
As n 


Pro 


Leu 

5 


Vd I 


A C t*l 
Ail 1 


Gin 


Ala 


Ala 
10 


Met 


Val 


Leu 


Pro 


val 
15 


Leu 


Leu 


Ser 


Ala 
20 


eys 


Leu 


o i y 


Gly 


Gly 
25 


Gly 


Ser 


Phe 


Asp 


Leu 
30 


Asp 


Val 


Asp 


1 n 1 

35 


o 1 U 


Ala 


Pro 


Arg 


Pro 
40 


Ala 


Pro 


Lys 


Tyr 


Gin 

45 


Asp 


Val 


ser 


Glu 
50 


Lys 


Pro 


u i n 


A 1 d 


ri n 
o 1 n 

55 


Lys 


Asp 


Gin 


Gly 


Gly 
60 


Tyr 


Gly 


Phe 


Met 
65 


Arg 


Leu 


Lys 


Arg 


Arg 
70 


Asn 


Trp 


Tyr 


Pro 


Gin 

75 


Ala 


Lys 


Glu 


Asp 


Val 


Lys 


Leu 


Asp 


r 1 1 1 
G > U 

85 


Se r 


Asp 


Trp 


Glu 


Ala 
90 


Thr 


Gly 


Leu 


Pro 


Asp 
95 


Pro 


Lys 


G 1 U 


Leu 
100 


Pro 


Lys 


Arg 


Gin 


Lys 
105 


Ser 


Val 


He 


Glu 


Lys 
110 


val 


Thr 


ASP 


Ser 
115 


ASp 


Asn 


Asn 


1 1 e 


Tyr 
120 


ser 


Ser 


Pro 


Tyr 


Leu 
125 


Lys 


Pro 


Asn 


His 
130 


G I n 


Asn 


G i y 


Asn 


Th r 

135 


Gly 


Asn 


Gly 


He 


Asn 
140 


Gin 


Pro 


Lys 


Gin 
145 


Ala 


Lys 


ASp 


Tyr 


G 1 U 

150 


Asn 


Phe 


Lys 


Tyr 


val 
155 


Tyr 


ser 


Gly 


Trp 


Tyr 


Lys 


Hi s 


A I a 


Lys 
165 


Arg 


G 1 U 


Phe 


Asn 


Leu 
170 


Lys 


val 


Glu 


Pro 


Lys 
175 


Ala 


Lys 


Asn 


G I y 
180 


Asp 


Asp 


G l y 


Tyr 


He 
185 


Phe 


Tyr 


His 


Gly 


Lys 
190 


Glu 


Ser 


Arg 


\j 1 n 
195 


Leu 


Pro 


a I a 


Ser 


Gly 
200 


Lys 


He 


Thr 


Tyr 


Lys 
205 


Gly 


val 


His 


Phe 
210 


A I a 


Tnr 


Asp 


i n r 


Lys 
215 


Lys 


Gly 


Gin 


Lys 


Phe 

220 


Arg 


Glu 


He 


Gin 

225 


Pro 


Ser 


Lys 


Ser 


G I n 
230 


g i y 


Asp 


Arg 


Tyr 


Ser 
235 


Gly 


Phe 


Ser 


Gly 


Asp 


Gly 


G 1 U 


G 1 U 


Tyr 
245 


Ser 


Asn 


Lys 


Asn 


Lys 
250 


Ser 


Thr 


Leu 


Thr 


Asp 
255 


Gin 


Glu 


Gly 


Tyr 
260 


G i y 


phe 


\ n r 


ser 


Asn 
265 


Leu 


Glu 


val 


ASp 


Phe 
270 


His 


Lys 


Lys 


Leu 

275 


Thr 


Gly 


Lys 


Leu 


lie 

280 


Arg 


Asn 


Asn 


Ala 


Asn 
285 


Thr 


Asp 


Asn 


Gin 
290 


Ala 


Thr 


Thr 


Thr 


Gin 

295 


Tyr 


Tyr 


Ser 


Leu 


Glu 

300 


Ala 


Gin 


val 


Gly 

305 


Asn 


Arg 


Phe 


Asn 


Gly 
310 


Lys 


Ala 


Thr 


Ala 


Thr 
315 


Asp 


Lys 


Pro 


Gin 


Asn 


Ser 


Glu 


Thr 


Lys 
325 


Glu 


His 


Pro 


Phe 


val 

330 


Ser 


Asp 


Ser 


Ser 


Ser 
335 



80 



160 



240 



320 



131 
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ser Gly Gly Phe Phe Gly Pro Gin Gly Glu Glu Leu Gly Phe Arg Phe 
340 345 350 

Leu Ser Asp Asp Gin Lys Val Ala val Val Gly Ser Ala Lys Thr Lys 
355 360 365 

Asp Lys Pro Ala Asn Gly Asn Thr Ala Ala Ala ser Gly Gly Thr Asp 
370 375 380 

Ala Ala Ala Ser Asn Gly Ala Ala Gly Thr Ser Ser Glu Asn Gly Lys 
385 390 395 400 

Leu Thr Thr val Leu Asp Ala Val Glu Leu Lys Leu Gly Asp Lys Glu 
405 410 415 

val Gin Lys Leu Asp Asn Phe Ser Asn Ala Ala Gin Leu val val Asp 
420 425 430 

Gly lie Met lie Pro Leu Leu Pro Glu Ala Ser Glu Ser Gly Asn Asn 
435 440 445 

Gin Ala Asn Gin Gly Thr Asn Gly Gly Thr Ala Phe Thr Arg Lys Phe 
450 455 460 

Asp His Thr Pro Glu Ser Asp Lys Lys Asp Ala Gin Ala Gly Thr Gin 
465 470 475 480 

Thr Asn Gly Ala Gin Thr Ala Ser Asn Thr Ala Gly Asp Thr Asn Gly 
485 490 495 

Lys Thr Lys Thr Tyr Glu Val Glu val Cys Cys Ser Asn Leu Asn Tyr 
500 505 510 

Leu Lys Tyr Gly Met Leu Thr Arg Lys Asn ser Lys Ser Ala Met Gin 
515 520 525 

Ala Gly Glu Ser ser ser Gin Ala Asp Ala Lys Thr Glu Gin val Glu 
530 535 540 

Gin Ser Met Phe Leu Gin Gly Glu Arg Thr Asp Glu Lys Glu lie Pro 
545 550 555 560 

Ser Glu Gin Asn lie Val Tyr Arg Gly Ser Trp Tyr Gly Tyr lie Ala 
565 570 575 

Asn Asp Lys Ser Thr Ser Trp Ser Gly Asn Ala Ser Asn Ala Thr Ser 
580 585 590 

Gly Asn Arg Ala Glu Phe Thr val Asn Phe Ala Asp Lys Lys lie Thr 
595 600 605 

Gly Thr Leu Thr Ala Asp Asn Arg Gin Glu Ala Thr Phe Thr lie Asp 
610 615 620 

Gly Asn lie Lys Asp Asn Gly Phe Glu Gly Thr Ala Lys Thr Ala Glu 
625 630 635 640 

Ser Gly Phe Asp Leu Asp Gin Ser Asn Thr Thr Arg Thr Pro Lys Ala 
645 650 655 

Tyr lie Thr Asp Ala Lys val Gin Gly Gly Phe Tyr Gly Pro Lys Ala 
660 665 670 

Glu Glu Leu Gly Gly Trp Phe Ala Tyr Pro Gly Asp Lys Gin Thr Lys 
675 680 685 

Asn Ala Thr Asn Ala Ser Gly Asn Ser Ser Ala Thr Val Val Phe Gly 
690 695 700 
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Ala Lys Arg Gin Gin Pro Val Arg 
705 710 

<210> 86 

<211> 274 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> 741 

<400> 86 

val Asn Arg Thr Ala Phe cys Cys Leu Ser Leu Thr Thr Ala Leu lie 
1 5 10 15 

Leu Thr Ala Cys Ser Ser Gly Gly Gly Gly Val Ala Ala Asp lie Gly 
20 25 30 

Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro Leu Asp His Lys Asp Lys 
35 40 45 

Gly Leu Gin Ser Leu Thr Leu Asp Gin Ser Val Arg Lys Asn Glu Lys 
50 55 60 

Leu Lys Leu Ala Ala Gin Gly Ala Glu Lys Thr Tyr Gly Asn Gly Asp 
65 70 75 80 

Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp Lys Val Ser Arg Phe Asp 
85 90 95 

Phe lie Arg Gin lie Glu Val Asp Gly Gin Leu lie Thr Leu Glu Ser 
100 105 110 

Gly Glu Phe Gin val Tyr Lys Gin ser His ser Ala Leu Thr Ala Phe 
115 120 125 

Gin Thr Glu Gin lie Gin Asp Ser Glu His Ser Gly Lys Met Val Ala 
130 135 140 

Lys Arg Gin Phe Arg lie Gly Asp lie Ala Gly Glu His Thr Ser Phe 
145 150 155 160 

Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr Tyr Arg Gly Thr Ala Phe 
165 170 175 

Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr Tyr Thr lie Asp Phe Ala 
180 185 190 

Ala Lys Gin Gly Asn Gly Lys lie Glu His Leu Lys ser Pro Glu Leu 
195 200 205 

Asn val Asp Leu Ala Ala Ala Asp lie Lys Pro Asp Gly Lys Arg His 
210 215 220 

Ala Val lie Ser Gly Ser val Leu Tyr Asn Gin Ala Glu Lys Gly Ser 
225 230 235 240 

Tyr Ser Leu Gly lie Phe Gly Gly Lys Ala Gin Glu val Ala Gly Ser 
245 250 255 

Ala Glu Val Lys Thr Val Asn Gly lie Arg His lie Gly Leu Ala Ala 
260 265 270 

Lys Gin 

<210> 87 
<211> 1082 
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EP 



1 790 660 A2 



<212> 
<213> 


PRT 

Arti f i ci al Sequence 






<220> 
<223> 


yoo 














<400> 
Met Arg 
1 


87 
i n r 


i n r 


Pro 

5 


1 n r 


rile 


Pro 


Thr 


Met 


Ala 


Leu 


a i a 
20 


va 1 


Ala 

a i a 


i nr 


Thr 


Leu 

25 


Gly 


Gly 


Gly 

35 


Thr 


Ser 


Ala 


Pro 


Asp 
40 


Phe 


Gly 


ser 
50 


As n 


Se r 


Arg 


Ala 

a i a 


i n r 

55 


Thr 


Ala 


Ala 
65 


Gly 


1 1 e 


Lys 


Asn 


G 1 U 

70 


Met 


Cys 


Lys 


Gly 


Arg 


Asp 


Asp 


va l 
85 


A I a 


va I 


Thr 


Asp 


Pro 


Pro 


Pro 


Asn 
100 


Leu 


Hi s 


Thr 


Gly 


Asp 

105 


Tyr 


Lys 


Asn 
115 


Leu 


1 1 e 


Asn 


Leu 


Lys 
120 


Pro 


Gly 


Arg 
130 


Gly 


Va 1 


Gl U 


va i 


Gl y 
135 


lie 


val 


Ser 
145 


He 


Ser 


Phe 


Pro 


G 1 U 
150 


Leu 


Tyr 


Gly 


Glu 


Asn 


Tyr 


Lys 


Asn 
165 


Tyr 


Thr 


Ala 


Tyr 


Asp 


Gly 


Gly 


G ly 
180 


Lys 


ASp 


1 1 e 


Glu 


Ala 
185 


He 


Glu 


Th r 
195 


G 1 U 


A I a 


Lys 


Pro 


Thr 
200 


Asp 


Gly 


His 
210 


1 1 e 


ASp 


Leu 


va I 


ser 
215 


His 


He 


Gly 

225 


Arg 


Pro 


A la 


G ly 


G I y 

230 


1 1 e 


Ala 


Pro 


Asn 


Thr 


Asn 


ASp 


G 1 U 
245 


Th r 


Lys 


Asn 


Glu 


Asn 


Ala 


Trp 


Val 
260 


Lys 


Leu 


G I y 


Glu 


Arg 
265 


Ser 


Phe 


Gly 

275 


Thr 


Thr 


Ser 


Arg 


Ala 
280 


Gly 


Ala 


Asn 
290 


Ser 


Glu 


Glu 


Gin 


Tyr 
295 


Arg 


Gin 


Gly 
305 


Asp 


Lys 


Thr 


Asp 


Glu 
310 


Gly 


lie 


Arg 


Gly 


Asn 


Leu 


ser 


Tyr 
325 


His 


He 


Arg 


Asn 



10 15 

Ser Ala Cys Leu Gly Gly Gly 
30 

Asn Ala Gly Gly Thr Gly lie 
45 

Lys Ser Ala Ala val ser Tyr 
60 

Asp Arg Ser Met Leu Cys Ala 
75 80 

Arg Asp Ala Lys lie Asn Ala 
90 95 

Phe Pro Asn Pro Asn Asp Ala 
110 

Ala He Glu Ala Gly Tyr Thr 
125 

Asp Thr Gly Glu Ser val Gly 
140 

Arg Lys Glu His Gly Tyr Asn 
155 160 

Met Arg Lys Glu Ala Pro Glu 
170 175 

Ser Phe Asp Asp Glu Ala val 
190 

lie Arg His Val Lys Glu lie 
205 

lie Gly Gly Arg ser val Asp 
220 

Asp Ala Thr Leu His lie Met 
235 240 

Met Met Val Ala Ala lie Arg 

250 255 

Gly val Arg lie val Asn Asn 
270 

Thr Ala Asp Leu Phe Gin lie 

285 

Ala Leu Leu Asp Tyr Ser Gly 
300 

Leu Met Gin Gin Ser Asp Tyr 
315 320 

Lys Asn Met Leu Phe lie Phe 
330 335 
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Ser Thr Gly Asn Asp Ala Gin Ala Gin Pro Asn Thr Tyr Ala Leu Leu 
340 345 350 

Pro Phe Tyr Glu Lys Asp Ala Gin Lys Gly lie lie Thr val Ala Gly 
355 360 365 

Val Asp Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro 
370 375 380 

Gly Thr Glu Pro Leu Glu Tyr Gly ser Asn His cys Gly lie Thr Ala 
385 390 395 400 

Met Trp Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg 
405 ' 410 415 

Thr Asn Pro lie Gin lie Ala Gly Thr ser Phe ser Ala Pro lie val 
420 425 430 

Thr Gly Thr Ala Ala Leu Leu Leu Gin Lys Tyr Pro Trp Met Ser Asn 
435 440 445 

Asp Asn Leu Arg Thr Thr Leu Leu Thr Thr Ala Gin Asp lie Gly Ala 
450 455 460 

val Gly val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys 
465 470 475 480 

Ala Met Asn Gly pro Ala ser Phe pro Phe Gly Asp Phe Thr Ala Asp 
485 490 495 

Thr Lys Gly Thr ser Asp lie Ala Tyr Ser Phe Arg Asn Asp lie Ser 
500 505 510 

Gly Thr Gly Gly Leu lie Lys Lys Gly Gly ser Gin Leu Gin Leu His 
515 520 525 

Gly Asn Asn Thr Tyr Thr Gly Lys Thr lie lie Glu Gly Gly Ser Leu 
530 535 540 

val Leu Tyr Gly Asn Asn Lys ser Asp Met Arg val Glu Thr Lys Gly 
545 550 555 560 

Ala Leu lie Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp 
565 570 575 

Gly lie val Tyr Leu Ala Asp Thr Asp Gin Ser Gly Ala Asn Glu Thr 
580 585 590 

Val His lie Lys Gly Ser Leu Gin Leu Asp Gly Lys Gly Thr Leu Tyr 
595 600 605 

Thr Arg Leu Gly Lys Leu Leu Lys val Asp Gly Thr Ala lie lie Gly 
610 615 620 

Gly Lys Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn 
62 5 630 635 640 

Ser Thr Gly Arg Arg val Pro Phe Leu Ser Ala Ala Lys lie Gly Gin 
645 650 655 

Asp Tyr Ser Phe Phe Thr Asn lie Glu Thr Asp Gly Gly Leu Leu Ala 
660 665 670 

Ser Leu Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu 
675 680 685 

Ser Tyr Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala 
690 695 700 
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Ala His ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gin Gly Gly 
705 710 715 720 

Ser Asn Leu Glu Asn Leu Met val Glu Leu Asp Ala Ser Glu Ser Ser 

725 730 735 

Ala Thr Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met 
740 745 750 

Pro Gly lie Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val 
755 760 765 

Gin His Ala Asn Ala Ala Asp Gly val Arg lie Phe Asn Ser Leu Ala 
770 775 780 

Ala Thr val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gin Gly 
785 790 795 800 

Arg Arg Leu Lys Ala val Ser Asp Gly Leu Asp His Asn Gly Thr Gly 
805 810 815 

Leu Arg val lie Ala Gin Thr Gin Gin Asp Gly Gly Thr Trp Glu Gin 
820 825 830 

Gly Gly val Glu Gly Lys Met Arg Gly Ser Thr Gin Thr val Gly lie 
835 840 845 

Ala Ala Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met 
850 855 860 

Gly Arg Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser 
865 870 875 880 

lie Ser Leu Phe Ala Gly lie Arg His Asp Ala Gly Asp lie Gly Tyr 
885 890 895 

Leu Lys Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser lie Ser Arg 
900 905 910 

Ser Thr Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu 
915 920 925 

Met Gin Leu Gly Ala Leu Gly Gly val Asn val pro Phe Ala Ala Thr 
930 935 940 

Gly Asp Leu Thr val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gin 
945 950 955 960 

Asp Ala Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser 
965 970 975 

Leu Thr Glu Gly Thr Leu val Gly Leu Ala Gly Leu Lys Leu ser Gin 
980 985 990 

Pro Leu Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu Arg 
995 1000 1005 

Asp Leu Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala 
1010 1015 1020 

Thr Ala Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg 
1025 1030 1035 1040 

Leu Val Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn 
1045 1050 1055 

Gly Leu Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gin Tyr Gly Asn His 
1060 1065 1070 



136 



EP 1 790 660 A2 



Ser Gly Arg val Gly Val Gly Tyr Arg Phe 
1075 1080 



10 



15 



20 



25 



30 



35 



40 



45 



50 



88 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 

<400> 
atggctagcc 
gctgaaaaag 
gcgccatcca 
ggcggtgcgg 
ccgcaaaatt 
gattccgccc 
gatttggcta 
aaaggcgatt 
tttgaaaatt 
aaatttacta 
atttataaag 
aggaggtcgc 
gtcgatgggg 
aattaccggt 
gtgcaaggcg 
gtgctgcatt 
aaagtcgatt 
atgggtacgc 
gaaaatggcg 
ggaaaataca 
aaaaaagagc 
ccgcaacccg 
gccggaacga 
ccccactggg 
aatttgaaaa 
cattcctttc 
aacggaagcc 
aggcggacgg 
cccctgcctg 
aacagcggca 
atcaccgcgc 
cacacgcgca 
gccgaagacc 
ccgtccggca 
atcggacgct 
atcaaagcct 
agctatatct 
ggcacgccgt 
gcgcccttat 
atggcgcagg 
tacggcgacg 
ctcctaccca 



88 

2505 
DNA 

Art i f i ci al sequence 



deltaG287-919 



ccgatgttaa 
agacagaggt 
cacaaggcag 
caacaacgga 
ccgccgaatc 
ccgcgtcaaa 
atggcgtttt 
cttgtaatgg 
taaatgagtc 
atttggttgc 
acaagtccgc 
ttcctgccga 
aagcggtcag 
atctgactta 
aaccggcaaa 
ttcatacgga 
tcggcagcaa 
aaaaattcaa 
gcggggatgt 
gctatcgccc 
aggatggatc 
acacatccgt 
cggtcggcgg 
cggcgcagga 
accgccaagg 
aggcaaaaca 
ttgccggtac 
cacaagcccg 
ccggtttgcg 
caatcgacaa 
gcacaacggc 
accaaatcaa 
ccgtcgaact 
aatacatccg 
atatggcgga 
atatgcggca 
ttttccgcga 
tgatggggga 
ttgtcgccac 
ataccggcag 
aagccggcga 
acggtatgaa 



atcggcggac 
aaaagaagat 
ccaagatatg 
caaacccaaa 
cgcaaatcaa 
ccctgcacct 
gattgatggg 
tgataattta 
tgaacgaatt 
gacagcagtt 
ttcatcttca 
gatgccgcta 
cctgacgggg 
cggggcggaa 
aggcgaaatg 
aaacggccgt 
atctgtggac 
agccgccatc 
ttccggaagg 
gacagatgcg 
cggaggagga 
catcaacggc 
cggcggggcc 
tttcgccaaa 
ctggcaggat 
gttttttgaa 
ggttaccggc 
cttcccgatt 
gagcggaaaa 
taccggcggc 
aatcaaaggc 
cggcggcgcg 
tttttttatg 
catcggctat 
caaaggctac 
aaatccgcaa 
gcttgccgga 
atatgccggc 
cgcccatccg 
cgcgattaaa 
acttgccggc 
gcccgaatac 



acgctgtcaa 
gcgccacagg 
gcggcagttt 
aatgaagacg 
acagggaaca 
gcgaatggcg 
ccgtcgcaaa 
ttggatgaag 
gagaaatata 
caagctaatg 
tctgcgcgat 
atccccgtca 
cattccggca 
aaattgcccg 
cttgctggca 
ccgtacccga 
ggcattatcg 
gatggaaacg 
ttttacggcc 
gaaaagggcg 
ggatgccaaa 
ccggaccggc 
gtctataccg 
agcctgcaat 
gtgtgcgccc 
cgctatttca 
tattacgagc 
tacggtattc 
gcccttgtcc 
acacataccg 
aggtttgaag 
cttgacggca 
cacatccaag 
gccgacaaaa 
ctcaagctcg 
cgcctcgccg 
agcagcaatg 
gcagtcgacc 
gttacccgca 
ggcgcggtgc 
aaacagaaaa 
cgcccgtaac 



aaccggccgc 
caggttctca 
cggcagaaaa 
agggaccgca 
accaacccgc 
gtagcaattt 
atataacgtt 
aagcaccgtc 
agaaagatgg 
gaactaacaa 
tcaggcgttc 
atcaggcgga 
atatcttcgc 
gcggatcgta 
cggccgtgta 
ctagaggcag 
acagcggcga 
gctttaaggg 
cggccggcga 
gattcggcgt 
gcaagagcat 
cggtcggcat 
ttgtaccgca 
ccttccgcct 
aagcctttca 
cgccgtggca 
cggtgctgaa 
ccgacgattt 
gcatcaggca 
ccgacctctc 
gaagccgctt 
aagccccgat 
gctcgggccg 
acgaacatcc 
ggcagacctc 
aagttttggg 
acggtcccgt 
ggcactacat 
aagccctcaa 
gcgtggatta 
ccacgggtta 
tcgag 



tcctgttgtt 
aggacagggc 
tacaggcaat 
aaatgatatg 
cgattcttca 
tggaagggtt 
gacccactgt 
aaaatcagaa 
gaaaagcgat 
atatgtcatc 
tgcacggtcg 
tacgctgatt 
gcccgaaggg 
tgccctccgt 
caacggcgaa 
gtttgccgca 
tgatttgcat 
gacttggacg 
ggaagtggcg 
gtttgccggc 
ccaaaccttt 
ccccgacccc 
cctgtccctg 
cggctgcgcc 
aacccccgtc 
ggttgcaggc 
gggcgacgac 
tatctccgtc 
gacgggaaaa 
ccgattcccc 
cctcccctac 
actcggttac 
tctgaaaacc 
ctacgtttcc 
gatgcagggc 
tcaaaacccc 
cggcgcactg 
taccttgggc 
ccgcctgatt 
tttttgggga 
cgtctggcag 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



89 

832 

PRT 

Arti f i ci al Sequence 
deltaG287-919 



55 



<400> 89 

Met Ala ser Pro Asp Val Lys Ser Ala Asp Thr Leu Ser Lys Pro Ala 
1 5 10 15 

Ala pro val val Ala Glu Lys Glu Thr Glu val Lys Glu Asp Ala Pro 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2505 
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20 25 30 

Gin Ala Gly Ser Gin Gly Gin Gly Ala Pro Ser Thr Gin Gly Ser Gin 
35 40 45 

Asp Met Ala Ala val ser Ala Glu Asn Thr Gly Asn Gly Gly Ala Ala 
50 55 60 

Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Pro Gin Asn Asp Met 
65 70 75 80 

Pro Gin Asn Ser Ala Glu Ser Ala Asn Gin Thr Gly Asn Asn Gin Pro 
85 90 95 

Ala Asp ser Ser Asp Ser Ala Pro Ala Ser Asn Pro Ala Pro Ala Asn 
100 105 110 

Gly Gly Ser Asn Phe Gly Arg val Asp Leu Ala Asn Gly Val Leu lie 
115 120 125 

Asp Gly Pro Ser Gin Asn lie Thr Leu Thr His Cys Lys Gly Asp Ser 
130 135 140 

Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu Ala Pro Ser Lys Ser Glu 
145 150 155 160 

Phe Glu Asn Leu Asn Glu Ser Glu Arg lie Glu Lys Tyr Lys Lys Asp 
165 170 175 

Gly Lys Ser Asp Lys Phe Thr Asn Leu val Ala Thr Ala val Gin Ala 
180 185 190 

Asn Gly Thr Asn Lys Tyr val lie lie Tyr Lys Asp Lys Ser Ala Ser 
195 200 205 

Ser Ser Ser Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu 
210 ^ 215 220 

Pro Ala Glu Met Pro Leu lie Pro val Asn Gin Ala Asp Thr Leu lie 

225 230 235 240 

val Asp Gly Glu Ala val ser Leu Thr Gly His ser Gly Asn lie Phe 

245 250 255 

Ala pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu 
260 265 270 

Pro Gly Gly ser Tyr Ala Leu Arg val Gin Gly Glu pro Ala Lys Gly 
275 280 285 

Glu Met Leu Ala Gly Thr Ala val Tyr Asn Gly Glu val Leu His Phe 
290 295 300 

His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala 
305 310 315 320 

Lys val Asp Phe Gly ser Lys ser val Asp Gly lie lie Asp ser Gly 
325 330 335 

Asp Asp Leu His Met Gly Thr Gin Lys Phe Lys Ala Ala lie Asp Gly 
340 345 350 

Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val Ser 
355 360 365 

Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr Ser 
370 375 380 

Tyr Arg pro Thr Asp Ala Glu Lys Gly Gly phe Gly Val Phe Ala Gly 
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385 390 395 400 

Lys Lys Glu Gin Asp Gly Ser Gly Gly Gly Gly Cys Gin ser Lys Ser 
405 410 415 

lie Gin Thr Phe Pro Gin Pro Asp Thr ser val lie Asn Gly Pro Asp 
420 425 430 

Arg Pro Val Gly lie Pro Asp Pro Ala Gly Thr Thr val Gly Gly Gly 
435 440 445 

Gly Ala val Tyr Thr val val Pro His Leu Ser Leu Pro His Trp Ala 
450 455 460 

Ala Gin Asp Phe Ala Lys Ser Leu Gin Ser Phe Arg Leu Gly Cys Ala 
465 470 475 480 

Asn Leu Lys Asn Arg Gin Gly Trp Gin Asp val cys Ala Gin Ala Phe 
485 490 495 

Gin Thr Pro val His Ser Phe Gin Ala Lys Gin Phe Phe Glu Arg Tyr 
500 505 510 

Phe Thr Pro Trp Gin val Ala Gly Asn Gly Ser Leu Ala Gly Thr Val 

515 520 525 

Thr Gly Tyr Tyr Glu Pro Val Leu Lys Gly Asp Asp Arg Arg Thr Ala 
530 53 5 540 

Gin Ala Arg Phe Pro lie Tyr Gly lie Pro Asp Asp Phe lie Ser val 
545 550 555 560 

Pro Leu Pro Ala Gly Leu Arg ser Gly Lys Ala Leu val Arg lie Arg 
565 570 575 

Gin Thr Gly Lys Asn Ser Gly Thr lie Asp Asn Thr Gly Gly Thr His 
580 585 590 

Thr Ala Asp Leu Ser Arg Phe Pro lie Thr Ala Arg Thr Thr Ala lie 
595 600 605 

Lys Gly Arg Phe Glu Gly ser Arg Phe Leu Pro Tyr His Thr Arg Asn 
610 615 620 

Gin lie Asn Gly Gly Ala Leu Asp Gly Lys Ala Pro lie Leu Gly Tyr 
625 630 635 640 

Ala Glu Asp Pro Val Glu Leu Phe Phe Met His lie Gin Gly Ser Gly 
645 650 655 

Arg Leu Lys Thr Pro ser Gly Lys Tyr lie Arg lie Gly Tyr Ala Asp 
660 665 670 

Lys Asn Glu His Pro Tyr Val Ser lie Gly Arg Tyr Met Ala Asp Lys 
675 680 685 

Gly Tyr Leu Lys Leu Gly Gin Thr Ser Met Gin Gly lie Lys Ala Tyr 
690 695 700 

Met Arg Gin Asn Pro Gin Arg Leu Ala Glu val Leu Gly Gin Asn Pro 
705 ~ 710 715 720 

Ser Tyr lie Phe Phe Arg Glu Leu Ala Gly Ser Ser Asn Asp Gly Pro 
725 " 730 735 

val Gly Ala Leu Gly Thr pro Leu Met Gly Glu Tyr Ala Gly Ala val 
740 745 750 

Asp Arg His Tyr lie Thr Leu Gly Ala Pro Leu Phe val Ala Thr Ala 
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755 



760 



765 



His Pro Val Thr Arg i_ys Ala Leu Asn Arg Leu lie Met Ala Gin Asp 
770 775 780 

Thr Gly Ser Ala lie Lys Gly Ala val Arg Val Asp Tyr Phe Trp Gly 

785 790 795 800 

Tyr Gly Asp Glu Ala Gly Glu Leu Ala Gly Lys Gin Lys Thr Thr Gly 
805 810 815 

Tyr Val Trp Gin Leu Leu Pro Asn Gly Met Lys Pro Glu Tyr Arg Pro 
820 825 830 



<210> 90 

<211> 1746 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



deltaG287-953 



<400> 90 
atggctagcc 
gctgaaaaag 
gcgccatcca 
ggcggtgcgg 
ccgcaaaatt 
gattccgccc 
gatttggcta 
aaaggcgatt 
tttgaaaatt 
aaatttacta 
atttataaag 
aggaggtcgc 
gtcgatgggg 
aattaccggt 
gtgcaaggcg 
gtgctgcatt 
aaagtcgatt 
atgggtacgc 
gaaaatggcg 
ggaaaataca 
aaaaaagagc 
gccaacgccc 
ggtctgaccg 
atccccgttg 
gacatcttcg 
aacggcaaaa 
gtcaaactca 
tgcggcggcg 
aacgttggta 
ctcgag 



ccgatgttaa 
agacagaggt 
cacaaggcag 
caacaacgga 
ccgccgaatc 
ccgcgtcaaa 
atggcgtttt 
cttgtaatgg 
taaatgagtc 
atttggttgc 
acaagtccgc 
ttcctgccga 
aagcggtcag 
atctgactta 
aaccggcaaa 
ttcatacgga 
tcggcagcaa 
aaaaattcaa 
gcggggatgt 
gctatcgccc 
aggatggatc 
gtttcgccat 
gttccgtcga 
ccaacctgca 
atgccgccca 
aactggtttc 
aagccgaaaa 
acttcagcac 
tgaccaaaag 



atcggcggac 
aaaagaagat 
ccaagatatg 
caaacccaaa 
cgcaaatcaa 
ccctgcacct 
gattgatggg 
tgataattta 
tgaacgaatt 
gacagcagtt 
ttcatcttca 
gatgccgcta 
cctgacgggg 
cggggcggaa 
aggcgaaatg 
aaacggccgt 
atctgtggac 
agccgccatc 
ttccggaagg 
gacagatgcg 
cggaggagga 
cgaccatttc 
gttcgaccaa 
aagcggttcg 
atatccggac 
cgttgacggc 
attcaactgc 
caccatcgac 
cgtccgcatc 



acgctgtcaa 
gcgccacagg 
gcggcagttt 
aatgaagacg 
acagggaaca 
gcgaatggcg 
ccgtcgcaaa 
ttggatgaag 
gagaaatata 
caagctaatg 
tctgcgcgat 
atccccgtca 
cattccggca 
aaattgcccg 
cttgctggca 
ccgtacccga 
ggcattatcg 
gatggaaacg 
ttttacggcc 
gaaaagggcg 
ggagccacct 
aacaccagca 
gcaaaacgcg 
caacacttta 
atccgctttg 
aacctgacca 
taccaaagcc 
cgcaccaaat 
gacatccaaa 



aaccggccgc 
caggttctca 
cggcagaaaa 
agggaccgca 
accaacccgc 
gtagcaattt 
atataacgtt 
aagcaccgtc 
agaaagatgg 
gaactaacaa 
tcaggcgttc 
atcaggcgga 
atatcttcgc 
gcggatcgta 
cggccgtgta 
ctagaggcag 
acagcggcga 
gctttaaggg 
cggccggcga 
gattcggcgt 
acaaagtgga 
ccaacgtcgg 
acggtaaaat 
ccgaccacct 
tttccaccaa 
tgcacggcaa 
cgatggcgaa 
ggggcgtgga 
tcgaggcagc 



tcctgttgtt 
aggacagggc 
tacaggcaat 
aaatgatatg 
cgattcttca 
tggaagggtt 
gacccactgt 
aaaatcagaa 
gaaaagcgat 
atatgtcatc 
tgcacggtcg 
tacgctgatt 
gcccgaaggg 
tgccctccgt 
caacggcgaa 
gtttgccgca 
tgatttgcat 
gacttggacg 
ggaagtggcg 
gtttgccggc 
cgaatatcac 
cggtttttac 
cgacatcacc 
gaaatcagcc 
attcaacttc 
aaccgccccc 
aaccgaagtt 
ctacctcgtt 
caaacaataa 



<210> 91 

<211> 579 

<212> PRT 

<213> Artificial Sequence 



<220> 
<223> 



deltaG287-953 



<400> 91 

Met Ala Ser Pro Asp val Lys ser Ala Asp Thr Leu ser Lys Pro Ala 
1 5 10 15 

Ala Pro val val Ala Glu Lys Glu Thr Glu val Lys Glu Asp Ala Pro 
20 25 30 
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Gin Ala Gly 
35 

Asp Met Ala 
50 

Thr Thr Asp 
65 

pro Gin Asn 

Ala Asp ser 

Gly Gly ser 
115 

Asp Gly Pro 
130 

Cys Asn Gly 
145 

Phe Glu Asn 

Gly Lys Ser 

Asn Gly Thr 
195 

Ser Ser Ser 
210 

Pro Ala Glu 

225 

val Asp Gly 
Ala Pro Glu 

Pro Gly Gly 

275 

Glu Met Leu 
290 

His Thr Glu 

305 

Lys val Asp 
Asp Asp Leu 



Asn Gly Phe 
355 

Gly Arg Phe 

370 

Tyr Arg pro 
385 



Ser Gin Gly Gin 
Ala val 
Lys Pro 



Ser Ala 
55 



ser Ala 
85 

ser Asp 
100 

Asn Phe 

Ser Gin 

Asp Asn 

Leu Asn 
165 

Asp Lys 
180 

Asn Lys 

Ala Arg 

Met Pro 

Glu Ala 
245 

Gly Asn 
260 

Ser Tyr 
Ala Gly 
Asn Gly 

Phe Gly 

325 

His Met 
340 

Lys Gly 
Tyr Gly 
Thr Asp 



Lys Asn 
70 

Glu ser 

ser Ala 

Gly Arg 

Asn lie 
135 

Leu Leu 
150 

Glu Ser 

Phe Thr 

Tyr val 

Phe Arg 
215 

Leu lie 
230 

val ser 

Tyr Arg 

Ala Leu 

Thr Ala 
295 

Arg Pro 
310 

Ser Lys 
Gly Thr 
Thr Trp 



pro Ala 
375 

Ala Glu 
390 



Gly Ala 
40 

Glu Asn 
Glu ASp 

Ala Asn 

pro Ala 
105 

val Asp 
120 

Thr Leu 
Asp Glu 
Glu Arg 

Asn Leu 

185 

lie lie 

200 

Arg Ser 
Pro Val 
Leu Thr 

Tyr Leu 

265 

Arg Val 
280 

val Tyr 
Tyr Pro 
Ser Val 

Gin Lys 

345 

Thr Glu 
360 

Gly Glu 
Lys Gly 



Pro Ser Thr 

Thr Gly Asn 
60 

Glu Gly Pro 
75 

Gin Thr Gly 
90 

Ser Asn Pro 

Leu Ala Asn 

Thr His Cys 
140 

Glu Ala Pro 
155 

lie Glu Lys 
170 

val Ala Thr 

Tyr Lys Asp 

Ala Arg Ser 
220 

Asn Gin Ala 
235 

Gly His ser 
250 

Thr Tyr Gly 

Gin Gly Glu 

Asn Gly Glu 
300 

Thr Arg Gly 
315 

Asp Gly lie 
330 

Phe Lys Ala 
Asn Gly Gly 



Glu Val Ala 
380 

Gly Phe Gly 
395 



Gin Gly 
45 

Gly Gly 

Gin Asn 

Asn Asn 

Ala Pro 
110 

Gly val 
125 

Lys Gly 

Ser Lys 

Tyr Lys 

Ala val 
190 

Lys Ser 
205 

Arg Arg 
Asp Thr 
Gly Asn 



Ser Gin 

Ala Ala 

Asp Met 
80 

Gin Pro 
95 

Ala Asn 

Leu lie 

Asp ser 

Ser Glu 
160 

Lys Asp 
175 

Gin Ala 
Ala Ser 
Ser Leu 



Leu lie 
240 

lie Phe 
255 



Ala Glu 
270 



Lys Leu 
Lys Gly 
Val Leu His Phe 



Pro Ala 
285 



Arg Phe Ala Ala 
320 

lie Asp Ser Gly 

335 

Ala lie Asp Gly 

3 50 

Gly Asp val Ser 
365 

Gly Lys Tyr ser 



val Phe Ala Gly 
400 
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Lys Lys Glu 

Asp Glu Tyr 

Ser Thr Asn 
435 

Asp Gin Ala 
450 

Asn Leu Gin 
465 

Asp lie Phe 

Lys Phe Asn 

Thr Met His 
515 

Asn Cys Tyr 
530 

Phe Ser Thr 
545 

Asn Val Gly 
Ala Lys Gin 



Gin Asp 
405 

His Ala 
420 



Gly Ser 
Asn Ala 
Gly Phe 



Gly Gly Gly Gly 
410 

Arg Phe Ala lie 
425 



Tyr Gly Leu Thr 
440 



Val Gly 

Lys Arg Asp Gly Lys lie Asp lie 
455 

Ser Gly 



45 



Asp Ala 
485 

Phe Asn 
500 

Gly Lys 
Gin Ser 
Thr lie 



Met Thr 
565 



Ser Gin 
470 

Ala Gin 
Gly Lys 
Thr Ala 



His Phe 
Tyr Pro 



Lys Leu 
505 

Pro val 
520 



Pro Met 
535 



Ala Lys 
Thr Lys 
Lys Ser val Arg 



Asp Arg 
550 



Thr Asp 
475 

Asp lie 
490 

val ser 
Lys Leu 
Thr Glu 



Trp Gly 

555 

lie Asp 
570 



Ala Thr 

Asp His 

Gly Ser 
445 

Thr lie 
460 

His Leu 

Arg Phe 

val Asp 

Lys Ala 
525 

Val Cys 
540 

Val Asp 
He Gin 



Tyr Lys Val 
415 

Phe Asn Thr 
430 

val Glu Phe 

Pro val Ala 

Lys Ser Ala 
480 

Val Ser Thr 
495 

Gly Asn Leu 
510 

Glu Lys Phe 

Gly Gly Asp 

Tyr Leu val 
560 

lie Glu Ala 

575 



<210> 
<211> 
<212> 
<213> 

<220> 
<223> 



92 

2388 
DNA 

Arti f i ci al Sequence 



deltaG287-961 



<400> 92 
atggctagcc 
gctgaaaaag 
gcgccatcca 
ggcggtgcgg 
ccgcaaaatt 
gattccgccc 
gatttggcta 
aaaggcgatt 
tttgaaaatt 
aaatttacta 
atttataaag 
aggaggtcgc 
gtcgatgggg 
aattaccggt 
gtgcaaggcg 
gtgctgcatt 
aaagtcgatt 
atgggtacgc 
gaaaatggcg 
ggaaaataca 
aaaaaagagc 
gctgccactg 
gctggagaga 



ccgatgttaa 
agacagaggt 
cacaaggcag 
caacaacgga 
ccgccgaatc 
ccgcgtcaaa 
atggcgtttt 
cttgtaatgg 
taaatgagtc 
atttggttgc 
acaagtccgc 
ttcctgccga 
aagcggtcag 
atctgactta 
aaccggcaaa 
ttcatacgga 
tcggcagcaa 
aaaaattcaa 
gcggggatgt 
gctatcgccc 
aggatggatc 
tggccattgc 
ccatctacga 



atcggcggac 
aaaagaagat 
ccaagatatg 
caaacccaaa 
cgcaaatcaa 
ccctgcacct 
gattgatggg 
tgataattta 
tgaacgaatt 
gacagcagtt 
ttcatcttca 
gatgccgcta 
cctgacgggg 
cggggcggaa 
aggcgaaatg 
aaacggccgt 
atctgtggac 
agccgccatc 
ttccggaagg 
gacagatgcg 
cggaggagga 
tgctgcctac 
cattgatgaa 



acgctgtcaa 
gcgccacagg 
gcggcagttt 
aatgaagacg 
acagggaaca 
gcgaatggcg 
ccgtcgcaaa 
ttggatgaag 
gagaaatata 
caagctaatg 
tctgcgcgat 
atccccgtca 
cattccggca 
aaattgcccg 
cttgctggca 
ccgtacccga 
ggcattatcg 
gatggaaacg 
ttttacggcc 
gaaaagggcg 
ggagccacaa 
aacaatggcc 
gacggcacaa 



aaccggccgc 
caggttctca 
cggcagaaaa 
agggaccgca 
accaacccgc 
gtagcaattt 
atataacgtt 
aagcaccgtc 
agaaagatgg 
gaactaacaa 
tcaggcgttc 
atcaggcgga 
atatcttcgc 
gcggatcgta 
cggccgtgta 
ctagaggcag 
acagcggcga 
gctttaaggg 
cggccggcga 
gattcggcgt 
acgacgacga 
aagaaatcaa 
ttaccaaaaa 



tcctgttgtt 
aggacagggc 
tacaggcaat 
aaatgatatg 
cgattcttca 
tggaagggtt 
gacccactgt 
aaaatcagaa 
gaaaagcgat 
atatgtcatc 
tgcacggtcg 
tacgctgatt 
gcccgaaggg 
tgccctccgt 
caacggcgaa 
gtttgccgca 
tgatttgcat 
gacttggacg 
ggaagtggcg 
gtttgccggc 
tgttaaaaaa 
cggtttcaaa 
agacgcaact 
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gcagccgatg ttgaagccga cgactttaaa ggtctgggtc tgaaaaaagt cgtgactaac 1440 

ctgaccaaaa ccgtcaatga aaacaaacaa aacgtcgatg ccaaagtaaa agctgcagaa 1500 

tctgaaatag aaaagttaac aaccaagtta gcagacactg atgccgcttt agcagatact 1560 

gatgccgctc tggatgcaac caccaacgcc ttgaataaat tgggagaaaa tataacgaca 1620 

tttgctgaag agactaagac aaatatcgta aaaattgatg aaaaattaga agccgtggct 1680 

gataccgtcg acaagcatgc cgaagcattc aacgatatcg ccgattcatt ggatgaaacc 1740 

aacactaagg cagacgaagc cgtcaaaacc gccaatgaag ccaaacagac ggccgaagaa 1800 

accaaacaaa acgtcgatgc caaagtaaaa gctgcagaaa ctgcagcagg caaagccgaa 1860 

gctgccgctg gcacagctaa tactgcagcc gacaaggccg aagctgtcgc tgcaaaagtt 1920 

accgacatca aagctgatat cgctacgaac aaagataata ttgctaaaaa agcaaacagt 1980 

gccgacgtgt acaccagaga agagtctgac agcaaatttg tcagaattga tggtctgaac 2040 

gctactaccg aaaaattgga cacacgcttg gcttctgctg aaaaatccat tgccgatcac 2100 

gatactcgcc tgaacggttt ggataaaaca gtgtcagacc tgcgcaaaga aacccgccaa 2160 

ggccttgcag aacaagccgc gctctccggt ctgttccaac cttacaacgt gggtcggttc 2220 

aatgtaacgg ctgcagtcgg cggctacaaa tccgaatcgg cagtcgccat cggtaccggc 2280 

ttccgcttta ccgaaaactt tgccgccaaa gcaggcgtgg cagtcggcac ttcgtccggt 2340 

tcttccgcag cctaccatgt cggcgtcaat tacgagtggt aactcgag 2388 

<210> 93 

<211> 793 

<212> PRT 

<213> Artificial Sequence 

20 <220> 

<223> deltaG287-961 

<400> 93 

Met Ala ser Pro Asp val Lys ser Ala Asp Thr Leu Ser Lys Pro Ala 
1 5 10 15 

25 

Ala Pro val val Ala Glu Lys Glu Thr Glu val Lys Glu Asp Ala Pro 
20 25 30 

Gin Ala Gly Ser Gin Gly Gin Gly Ala Pro Ser Thr Gin Gly Ser Gin 
35 40 45 

Asp Met Ala Ala Val Ser Ala Glu Asn Thr Gly Asn Gly Gly Ala Ala 
50 55 60 

Thr Thr Asp Lys Pro Lys Asn Glu Asp Glu Gly Pro Gin Asn Asp Met 
65 70 75 80 

Pro Gin Asn Ser Ala Glu Ser Ala Asn Gin Thr Gly Asn Asn Gin Pro 
85 90 95 

Ala Asp Ser ser Asp Ser Ala Pro Ala Ser Asn Pro Ala Pro Ala Asn 
100 105 110 

Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala Asn Gly Val Leu lie 
115 120 125 

Asp Gly Pro Ser Gin Asn lie Thr Leu Thr His cys Lys Gly Asp Ser 
130 135 140 

45 Cys Asn Gly Asp Asn Leu Leu Asp Glu Glu Ala Pro ser Lys ser Glu 

145 150 155 160 

Phe Glu Asn Leu Asn Glu Ser Glu Arg lie Glu Lys Tyr Lys Lys Asp 
165 170 175 



30 



35 



40 



50 



55 



Gly Lys Ser Asp Lys Phe Thr Asn Leu Val Ala Thr Ala Val Gin Ala 
180 185 190 

Asn Gly Thr Asn Lys Tyr Val lie lie Tyr Lys Asp Lys Ser Ala Ser 
195 200 205 

Ser ser Ser Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser Leu 
210 215 220 
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Pro Ala Glu Met Pro Leu lie Pro Val Asn Gin Ala Asp Thr Leu He 
225 230 235 240 

Val Asp Gly Glu Ala val Ser Leu Thr Gly His Ser Gly Asn lie Phe 
245 250 255 

Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys Leu 
260 265 270 

Pro Gly Gly Ser Tyr Ala Leu Arg val Gin Gly Glu Pro Ala Lys Gly 
275 280 285 

Glu Met Leu Ala Gly Thr Ala Val Tyr Asn Gly Glu Val Leu His Phe 
290 295 300 

His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg Phe Ala Ala 
305 310 315 320 

Lys val Asp Phe Gly ser Lys ser val Asp Gly He lie Asp Ser Gly 
325 330 335 

Asp Asp Leu His Met Gly Thr Gin Lys Phe Lys Ala Ala lie Asp Gly 
340 345 350 

Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val Ser 
355 360 365 

Gly Arg Phe Tyr Gly Pro Ala Gly Glu Glu val Ala Gly Lys Tyr Ser 
370 375 380 

Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala Gly 
385 390 395 400 

Lys Lys Glu Gin Asp Gly Ser Gly Gly Gly Gly Ala Thr Asn Asp Asp 
405 410 415 

Asp Val Lys Lys Ala Ala Thr val Ala lie Ala Ala Ala Tyr Asn Asn 
420 425 430 

Gly Gin Glu lie Asn Gly Phe Lys Ala Gly Glu Thr lie Tyr Asp lie 
435 440 445 

Asp Glu Asp Gly Thr lie Thr Lys Lys Asp Ala Thr Ala Ala Asp Val 
450 455 460 

Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn 
465 470 475 480 

Leu Thr Lys Thr val Asn Glu Asn Lys Gin Asn Val Asp Ala Lys Val 
485 490 495 

Lys Ala Ala Glu Ser Glu lie Glu Lys Leu Thr Thr Lys Leu Ala Asp 
500 505 510 

Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr 
515 520 525 

Asn Ala Leu Asn Lys Leu Gly Glu Asn lie Thr Thr Phe Ala Glu Glu 
530 535 540 

Thr Lys Thr Asn lie val Lys lie Asp Glu Lys Leu Glu Ala val Ala 
545 550 555 560 

Asp Thr val Asp Lys His Ala Glu Ala Phe Asn Asp lie Ala Asp Ser 
565 570 575 

Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala val Lys Thr Ala Asn 
580 585 590 
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Glu Ala Lys Gin Thr Ala Glu Glu Thr Lys Gin Asn Val Asp Ala Lys 
595 600 605 

val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly 

610 615 620 

Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys Val 
625 630 635 640 

Thr Asp lie Lys Ala Asp lie Ala Thr Asn Lys Asp Asn lie Ala Lys 
645 650 655 

Lys Ala Asn Ser Ala Asp Val Tyr Thr Arg Glu Glu Ser Asp Ser Lys 
660 665 670 

phe val Arg lie Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr 
675 680 685 

Arg Leu Ala Ser Ala Glu Lys Ser lie Ala Asp His Asp Thr Arg Leu 

690 695 700 

Asn Gly Leu Asp Lys Thr Val Ser Asp Leu Arg Lys Glu Thr Arg Gin 
705 710 715 720 

Gly Leu Ala Glu Gin Ala Ala Leu Ser Gly Leu Phe Gin Pro Tyr Asn 
725 730 735 

val Gly Arg Phe Asn val Thr Ala Ala val Gly Gly Tyr Lys ser Glu 
740 745 750 

Ser Ala Val Ala lie Gly Thr Gly Phe Arg Phe Thr Glu Asn Phe Ala 
755 ' 760 765 

Ala Lys Ala Gly val Ala Val Gly Thr Ser Ser Gly Ser Ser Ala Ala 

770 775 780 

Tyr His val Gly val Asn Tyr Glu Trp 
785 790 

<210> 94 
<211> 2700 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> deltaG287NZ-919 
<400> 94 

atggctagcc ccgatgtcaa gtcggcggac acgctgtcaa aacctgccgc ccctgttgtt 

tctgaaaaag agacagaggc aaaggaagat gcgccacagg caggttctca aggacagggc 

gcgccatccg cacaaggcgg tcaagatatg gcggcggttt cggaagaaaa tacaggcaat 

ggcggtgcgg cagcaacgga caaacccaaa aatgaagacg agggggcgca aaatgatatg 

ccgcaaaatg ccgccgatac agatagtttg acaccgaatc acaccccggc ttcgaatatg 

ccggccggaa atatggaaaa ccaagcaccg gatgccgggg aatcggagca gccggcaaac 

caaccggata tggcaaatac ggcggacgga atgcagggtg acgatccgtc ggcaggcggg 

gaaaatgccg gcaatacggc tgcccaaggt acaaatcaag ccgaaaacaa tcaaaccgcc 

ggttctcaaa atcctgcctc ttcaaccaat cctagcgcca cgaatagcgg tggtgatttt 

ggaaggacga acgtgggcaa ttctgttgtg attgacgggc cgtcgcaaaa tataacgttg 

acccactgta aaggcgattc ttgtagtggc aataatttct tggatgaaga agtacagcta 

aaatcagaat ttgaaaaatt aagtgatgca gacaaaataa gtaattacaa gaaagatggg 

aagaatgacg ggaagaatga taaatttgtc ggtttggttg ccgatagtgt gcagatgaag 

ggaatcaatc aatatattat cttttataaa cctaaaccca cttcatttgc gcgatttagg 

cgttctgcac ggtcgaggcg gtcgcttccg gccgagatgc cgctgattcc cgtcaatcag 

gcggatacgc tgattgtcga tggggaagcg gtcagcctga cggggcattc cggcaatatc 

ttcgcgcccg aagggaatta ccggtatctg acttacgggg cggaaaaatt gcccggcgga 

tcgtatgccc tccgtgttca aggcgaacct tcaaaaggcg aaatgctcgc gggcacggca 

gtgtacaacg gcgaagtgct gcattttcat acggaaaacg gccgtccgtc cccgtccaga 

ggcaggtttg ccgcaaaagt cgatttcggc agcaaatctg tggacggcat tatcgacagc 

ggcgatggtt tgcatatggg tacgcaaaaa ttcaaagccg ccatcgatgg aaacggcttt 
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Gly Pro 

Ser Gly 
210 

Glu Lys 

225 

Lys Asn 

val Gin 

pro Thr 

Leu Pro 
290 

lie val 
305 

Phe Ala 

Leu Pro 

Gly Glu 

Phe His 
370 

Ala Lys 
385 

Gly Asp 

Gly Asn 

Ser Gly 

Ser Tyr 
450 

Gly Lys 
465 

Ser lie 
Asp Arg 
Gly Gly 



Ala Ala 
530 

Ala Asn 
545 



Ser Gin Asn 
195 

Asn Asn Phe 

Leu Ser Asp 

Asp Gly Lys 
245 

Met Lys Gly 
260 

Ser Phe Ala 
275 

Ala Glu Met 

Asp Gly Glu 

Pro Glu Gly 
325 

Gly Gly Ser 
340 

Met Leu Ala 

355 

Thr Glu Asn 

val Asp Phe 

Gly Leu His 
405 

Gly Phe Lys 
420 

Lys Phe Tyr 
435 

Arg Pro Thr 

Lys Glu Gin 

Gin Thr Phe 
485 

pro val Gly 
500 

Ala val Tyr 
515 

Gin Asp Phe 
Leu Lys Asn 



lie Thr 

Leu Asp 
215 

Ala Asp 
230 

Asn Asp 

lie Asn 

Arg Phe 

Pro Leu 
295 

Ala Val 
310 

Asn Tyr 

Tyr Ala 

Gly Thr 

Gly Arg 
375 

Gly Ser 
390 

Met Gly 

Gly Thr 

Gly Pro 

Asp Ala 
455 

Asp Gly 
470 

Pro Gin 
lie Pro 
Thr val 



Ala Lys 
535 

Arg Gin 
550 



Leu Thr 
200 

Glu Glu 
Lys lie 
Lys Phe 



Gin Tyr 

265 

Arg Arg 
280 

lie Pro 



Ser Leu 
Arg Tyr 



Leu Arg 
345 

Ala val 
360 



His Cys 

Val Gin 

Ser Asn 
235 

Val Gly 
250 

lie lie 

Ser Ala 

Val Asn 

Thr Gly 
315 

Leu Thr 
330 

Val Gin 
Tyr Asn 



Pro Ser Pro Ser 



Lys Ser 
Thr Gin 

Trp Thr 

425 

Ala Gly 
440 

Glu Lys 
Ser Gly 
Pro Asp 



val Asp 
395 

Lys Phe 
410 

Glu Asn 
Glu Glu 
Gly Gly 



Gly Gly 
475 

Thr Ser 
490 



Ala Gly 
His Leu 
Ser Leu Gin ser 



Asp Pro 
505 

val Pro 
520 



Gly Trp 



Gin Asp 
555 



Lys Gly 
205 

Leu Lys 
220 

Tyr Lys 

Leu Val 

Phe Tyr 

Arg Ser 
285 

Gin Ala 
300 

His Ser 

Tyr Gly 

Gly Glu 

Gly Glu 
365 

Arg Gly 
380 

Gly lie 

Lys Ala 

Gly Gly 

val Ala 
445 

Phe Gly 
460 

Gly Cys 
val lie 
Thr Thr 



Ser Leu 
525 

Phe Arg 
540 

val Cys 



Asp Ser Cys 

ser Glu Phe 

Lys Asp Gly 
240 

Ala Asp Ser 
255 

Lys Pro Lys 
270 

Arg Arg Ser 
Asp Thr Leu 

Gly Asn lie 

320 

Ala Glu Lys 
335 

Pro Ser Lys 
350 

val Leu His 

Arg Phe Ala 

lie Asp Ser 
400 

Ala lie Asp 
415 

Gly Asp val 
430 

Gly Lys Tyr 

val Phe Ala 

Gin ser Lys 
480 

Asn Gly Pro 
495 

val Gly Gly 
510 

pro His Trp 
Leu Gly Cys 



Ala Gin Ala 
560 
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Phe Gin Thr Pro Val His Ser Phe Gin Ala Lys Gin Phe Phe Glu Arg 
565 570 575 

Tyr Phe Thr Pro Trp Gin Val Ala Gly Asn Gly Ser Leu Ala Gly Thr 
580 585 590 

Val Thr Gly Tyr Tyr Glu Pro Val Leu Lys Gly Asp Asp Arg Arg Thr 
595 600 605 

Ala Gin Ala Arg Phe Pro lie Tyr Gly lie Pro Asp Asp Phe lie Ser 
610 615 620 

val Pro Leu Pro Ala Gly Leu Arg ser Gly Lys Ala Leu val Arg lie 
625 630 635 640 

Arg Gin Thr Gly Lys Asn Ser Gly Thr He Asp Asn Thr Gly Gly Thr 
645 650 655 

His Thr Ala Asp Leu Ser Arg Phe Pro lie Thr Ala Arg Thr Thr Ala 
660 665 670 

lie Lys Gly Arg Phe Glu Gly Ser Arg Phe Leu Pro Tyr His Thr Arg 
675 680 685 

Asn Gin lie Asn Gly Gly Ala Leu Asp Gly Lys Ala pro lie Leu Gly 
690 695 700 

Tyr Ala Glu Asp Pro val Glu Leu Phe Phe Met His lie Gin Gly Ser 
705 710 715 720 

Gly Arg Leu Lys Thr Pro Ser Gly Lys Tyr lie Arg lie Gly Tyr Ala 
725 730 735 

Asp Lys Asn Glu His Pro Tyr val Ser lie Gly Arg Tyr Met Ala Asp 
740 745 750 

Lys Gly Tyr Leu Lys Leu Gly Gin Thr Ser Met Gin Gly lie Lys Ala 
755 760 765 

Tyr Met Arg Gin Asn Pro Gin Arg Leu Ala Glu val Leu Gly Gin Asn 
770 " 775 780 

Pro ser Tyr lie Phe Phe Arg Glu Leu Ala Gly Ser ser Asn Asp Gly 
785 790 795 800 

Pro Val Gly Ala Leu Gly Thr Pro Leu Met Gly Glu Tyr Ala Gly Ala 
805 810 815 

val Asp Arg His Tyr lie Thr Leu Gly Ala Pro Leu Phe val Ala Thr 
820 825 830 

Ala His Pro Val Thr Arg Lys Ala Leu Asn Arg Leu lie Met Ala Gin 
835 840 845 

Asp Thr Gly Ser Ala lie Lys Gly Ala val Arg val Asp Tyr Phe Trp 
850 855 860 

Gly Tyr Gly Asp Glu Ala Gly Glu Leu Ala Gly Lys Gin Lys Thr Thr 
865 870 875 880 

Gly Tyr Val Trp Gin Leu Leu Pro Asn Gly Met Lys Pro Glu Tyr Arg 
885 890 895 

Pro 

<210> 96 
<211> 1941 
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Gly Glu ser 
115 

Asp Gly Met 
130 

Asn Thr Ala 
145 

Gly Ser Gin 

Gly Gly Asp 

Gly Pro Ser 
195 

Ser Gly Asn 
210 

Glu Lys Leu 

225 

Lys Asn Asp 

val Gin Met 

Pro Thr Ser 
275 

Leu Pro Ala 
290 

lie val Asp 
305 

Phe Ala Pro 

Leu Pro Gly 

Gly Glu Met 
355 

Phe His Thr 
370 

Ala Lys val 
385 

Gly Asp Gly 
Gly Asn Gly 



100 

Glu Gin Pro Ala 



Gin Gly Asp Asp 
135 

Ala Gin Gly Thr 
150 

Asn Pro Ala Ser 
165 

Phe Gly Arg Thr 
180 

Gin Asn lie Thr 



Asn Phe 

Ser Asp 

Gly Lys 
245 

Lys Gly 
260 

Phe Ala 
Glu Met 
Gly Glu 



Leu Asp 
215 

Ala Asp 
230 

Asn Asp 
lie Asn 
Arg Phe 



Pro Leu 
295 

Ala val 
310 



Glu Gly 
325 



Asn Tyr 
Tyr Ala 
Leu Ala Gly Thr 



Gly ser 
340 



Glu Asn 
Asp Phe 



Leu Hi s 
405 

Phe Lys 
420 



Ser Gly Lys 
435 

Ser Tyr Arg 
450 



Gly Arg 
375 

Gly Ser 
390 

Met Gly 
Gly Thr 
Gly Pro 



Phe Tyr 
Pro Thr 

Gly Lys Lys Glu Gin Asp Gly 



Asp Ala 
455 



105 

Asn Gin 
120 

Pro Ser 

Asn Gin 

Ser Thr 

Asn val 
185 

Leu Thr 
200 

Glu Glu 

Lys lie 

Lys Phe 

Gin Tyr 
265 

Arg Arg 
280 

lie Pro 

Ser Leu 

Arg Tyr 

Leu Arg 
345 

Ala val 
360 

Pro Ser 

Lys Ser 

Thr Gin 

Trp Thr 
425 

Ala Gly 
440 

Glu Lys 
Ser Gly 



Pro Asp 

Ala Gly 

Ala Glu 
155 

Asn Pro 
170 

Gly Asn 
His Cys 
val Gin 

Ser Asn 

235 

val Gly 
250 

lie lie 

Ser Ala 

val Asn 

Thr Gly 
315 

Leu Thr 

330 

val Gin 
Tyr Asn 
Pro Ser 



val Asp 
395 

Lys Phe 
410 



Met Ala 
125 

Gly Glu 
140 

Asn Asn 



Ser Ala 

ser val 

Lys Gly 
205 

Leu Lys 

220 

Tyr Lys 

Leu val 

Phe Tyr 

Arg Ser 
285 

Gin Ala 
300 

His Ser 

Tyr Gly 

Gly Glu 

Gly Glu 
365 

Arg Gly 
380 

Gly He 
Lys Ala 
Gly Gly 



Glu Asn 
Glu Glu 
Gly Gly 
Gly Gly Gly Ala 



val Ala 
445 

Phe Gly 
460 



110 

Asn Thr Ala 

Asn Ala Gly 

Gin Thr Ala 
160 

Thr Asn Ser 
175 

Val lie Asp 
190 

Asp Ser cys 

Ser Glu Phe 

Lys Asp Gly 
240 

Ala Asp Ser 
255 

Lys Pro Lys 
270 

Arg Arg Ser 

Asp Thr Leu 

Gly Asn lie 
320 

Ala Glu Lys 
335 

Pro ser Lys 
350 

val Leu His 

Arg Phe Ala 

lie Asp Ser 
400 

Ala lie Asp 
415 

Gly Asp val 
430 

Gly Lys Tyr 
Val Phe Ala 
Thr Tyr Lys 
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465 

val Asp Glu Tyr 



Thr ser 

Phe Asp 

Ala Asn 
530 

Ala Asp 
545 

Thr Lys 
Leu Thr 
Phe Asn 



Thr Asn 
500 

Gin Ala 
515 

Leu Gin 
lie Phe 
Phe Asn 



Met His 
580 

Cys Tyr 
595 



Asp Phe 
610 

val Asn 
625 



Ser Thr 
val Gly 
Ala Ala Lys Gin 



470 

His Ala 
485 

val Gly 

Lys Arg 

Ser Gly 

Asp Ala 
550 

Phe Asn 
565 

Gly Lys 
Gin Ser 
Thr He 



Asn Ala 

Gly Phe 

Asp Gly 
520 

Ser Gin 
535 

Ala Gin 
Gly Lys 
Thr Ala 



Arg Phe 
490 

Tyr Gly 
505 

Lys lie 
His Phe 
Tyr Pro 



Lys Leu 
570 

pro val 
585 



475 

Ala lie Asp His 



480 

Phe Asn 
495 



Met Thr 
630 



Ala Lys 
Thr Lys 
Lys Ser Val Arg 



Pro Met 
600 

Asp Arg 
61" 



Leu Thr 

Asp lie 

Thr Asp 
540 

Asp lie 
555 

val ser 
Lys Leu 
Thr Glu 



Trp Gly 
620 

lie Asp 
635 



Gly ser 
510 

Thr He 

525 



val Glu 
Pro val 
His Leu Lys Ser 



Arg Phe 

Val Asp 

Lys Ala 
590 

val Cys 
605 

val Asp 
He Gin 



val ser 
560 

Gly Asn 
575 

Glu Lys 
Gly Gly 
Tyr Leu 



He Glu 
640 



<210> 98 

<211> 2583 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



deltaG287NZ-961 



<400> 98 
atggctagcc 
tctgaaaaag 
gcgccatccg 
ggcggtgcgg 
ccgcaaaatg 
ccggccggaa 
caaccggata 
gaaaatgccg 
ggttctcaaa 
ggaaggacga 
acccactgta 
aaatcagaat 
aagaatgacg 
ggaatcaatc 
cgttctgcac 
gcggatacgc 
ttcgcgcccg 
tcgtatgccc 
gtgtacaacg 
ggcaggtttg 
ggcgatggtt 

aaggggactt 

ggcgaggaag 

ggcgtgtttg 

gacgatgtta 



ccgatgtcaa 
agacagaggc 
cacaaggcgg 
cagcaacgga 
ccgccgatac 
atatggaaaa 
tggcaaatac 
gcaatacggc 
atcctgcctc 
acgtgggcaa 
aaggcgattc 
ttgaaaaatt 
ggaagaatga 
aatatattat 
ggtcgaggcg 
tgattgtcga 
aagggaatta 
tccgtgttca 
gcgaagtgct 
ccgcaaaagt 
tgcatatggg 
ggacggaaaa 
tggcgggaaa 
ccggcaaaaa 
aaaaagctgc 



gtcggcggac 
aaaggaagat 
tcaagatatg 
caaacccaaa 
agatagtttg 
ccaagcaccg 
ggcggacgga 
tgcccaaggt 
ttcaaccaat 
ttctgttgtg 
ttgtagtggc 
aagtgatgca 
taaatttgtc 
cttttataaa 
gtcgcttccg 
tggggaagcg 
ccggtatctg 
aggcgaacct 
gcattttcat 
cgatttcggc 
tacgcaaaaa 
tggcggcggg 
atacagctat 
agagcaggat 
cactgtggcc 



acgctgtcaa 
gcgccacagg 
gcggcggttt 
aatgaagacg 
acaccgaatc 
gatgccgggg 
atgcagggtg 
acaaatcaag 
cctagcgcca 
attgacgggc 
aataatttct 
gacaaaataa 
ggtttggttg 
cctaaaccca 
gccgagatgc 
gtcagcctga 
acttacgggg 
tcaaaaggcg 
acggaaaacg 
agcaaatctg 
ttcaaagccg 
gatgtttccg 
cgcccaacag 
ggatccggag 
attgctgctg 



aacctgccgc 
caggttctca 
cggaagaaaa 
agggggcgca 
acaccccggc 
aatcggagca 
acgatccgtc 
ccgaaaacaa 
cgaatagcgg 
cgtcgcaaaa 
tggatgaaga 
gtaattacaa 
ccgatagtgt 
cttcatttgc 
cgctgattcc 
cggggcattc 
cggaaaaatt 
aaatgctcgc 
gccgtccgtc 
tggacggcat 
ccatcgatgg 
gaaagtttta 
atgcggaaaa 
gaggaggagc 
cctacaacaa 



ccctgttgtt 
aggacagggc 
tacaggcaat 
aaatgatatg 
ttcgaatatg 
gccggcaaac 
ggcaggcggg 
tcaaaccgcc 
tggtgatttt 
tataacgttg 
agtacagcta 
gaaagatggg 
gcagatgaag 
gcgatttagg 
cgtcaatcag 
cggcaatatc 
gcccggcgga 
gggcacggca 
cccgtccaga 
tatcgacagc 
aaacggcttt 
cggcccggcc 
gggcggattc 
cacaaacgac 
tggccaagaa 
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atcaacggtt tcaaagctgg agagaccatc tacgacattg atgaagacgg cacaattacc 1560 

aaaaaagacg caactgcagc cgatgttgaa gccgacgact ttaaaggtct gggtctgaaa 1620 

aaagtcgtga ctaacctgac caaaaccgtc aatgaaaaca aacaaaacgt cgatgccaaa 1680 

5 gtaaaagctg cagaatctga aatagaaaag ttaacaacca agttagcaga cactgatgcc 1740 

gctttagcag atactgatgc cgctctggat gcaaccacca acgccttgaa taaattggga 1800 

gaaaatataa cgacatttgc tgaagagact aagacaaata tcgtaaaaat tgatgaaaaa 1860 

ttagaagccg tggctgatac cgtcgacaag catgccgaag cattcaacga tatcgccgat 1920 

tcattggatg aaaccaacac taaggcagac gaagccgtca aaaccgccaa tgaagccaaa 1980 

cagacggccg aagaaaccaa acaaaacgtc gatgccaaag taaaagctgc agaaactgca 2040 

gcaggcaaag ccgaagctgc cgctggcaca gctaatactg cagccgacaa ggccgaagct 2100 

10 gtcgctgcaa aagttaccga catcaaagct gatatcgcta cgaacaaaga taatattgct 2160 

aaaaaagcaa acagtgccga cgtgtacacc agagaagagt ctgacagcaa atttgtcaga 2220 

attgatggtc tgaacgctac taccgaaaaa ttggacacac gcttggcttc tgctgaaaaa 2280 

tccattgccg atcacgatac tcgcctgaac ggtttggata aaacagtgtc agacctgcgc 2340 

aaagaaaccc gccaaggcct tgcagaacaa gccgcgctct ccggtctgtt ccaaccttac 2400 

aacgtgggtc ggttcaatgt aacggctgca gtcggcggct acaaatccga atcggcagtc 2460 

15 gccatcggta ccggcttccg ctttaccgaa aactttgccg ccaaagcagg cgtggcagtc 2520 

ggcacttcgt ccggttcttc cgcagcctac catgtcggcg tcaattacga gtggtaaaag 2580 

2583 





Ctt 














20 


<210> 
<211> 
<212> 
<213> 


99 

858 

PRT 

Arti f i ci al sequence 




<220> 
<223> 


deltaG287N2 


-961 




25 


<400> 
Met Ala 
1 


99 
Ser 


Pro 


Asp 
5 


val 


Lys 




Ala 


Pro 


V a 1 


val 

V a 1 

20 


Cor 

JCl 


Glu 


Lys 


30 


Gin 


Ala 


Gly 
35 


Ser 


Gin 


o i y 


\j I ii 




Asp 


Met 
50 


Ala 


Ala 


val 


Ser 


Glu 

55 


35 


Ala 

65 


Thr 


Asp 


Lys 


Pro 


Lys 
70 


Asn 




Pro 


Gin 


Asn 


Ala 


Ala 
85 


Asp 


Thr 


40 


Ala 


Ser 


Asn 


Met 
100 


Pro 


Ala 


Gly 




Gly 


Glu 


Ser 
115 


Glu 


Gin 


Pro 


Ala 


45 


Asp 


Gly 
130 


Met 


Gin 


Gly 


Asp 


Asp 
135 




Asn 
145 


Thr 


Ala 


Ala 


Gin 


Gly 
150 


Thr 


50 


Gly 


Ser 


Gin 


Asn 


Pro 
165 


Ala 


Ser 




Gly 


Gly 


Asp 


Phe 
180 


Gly 


Arg 


Thr 


55 


Gly 


Pro 


Ser 
195 


Gin 


Asn 


lie 


Thr 




Ser 


Gly 


Asn 


Asn 


Phe 


Leu 


Asp 



10 15 

Glu Thr Glu Ala Lys Glu Asp Ala Pro 
25 30 

Gly Ala Pro Ser Ala Gin Gly Gly Gin 
40 45 

Glu Asn Thr Gly Asn Gly Gly Ala Ala 
60 

Glu Asp Glu Gly Ala Gin Asn Asp Met 
75 80 

Asp Ser Leu Thr Pro Asn His Thr Pro 
90 95 

Asn Met Glu Asn Gin Ala Pro Asp Ala 
105 110 

Asn Gin Pro Asp Met Ala Asn Thr Ala 
120 125 

Pro Ser Ala Gly Gly Glu Asn Ala Gly 
140 

Asn Gin Ala Glu Asn Asn Gin Thr Ala 

155 160 

Ser Thr Asn Pro Ser Ala Thr Asn ser 
170 175 

Asn Val Gly Asn Ser val val lie Asp 
185 190 
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210 215 220 

Glu Lys Leu ser Asp Ala Asp Lys lie Ser Asn Tyr Lys Lys Asp Gly 
225 230 235 240 

Lys Asn Asp Gly Lys Asn Asp Lys Phe val Gly Leu val Ala Asp ser 
245 250 255 

val Gin Met Lys Gly lie Asn Gin Tyr lie lie Phe Tyr Lys Pro Lys 
260 265 270 

Pro Thr Ser Phe Ala Arg Phe Arg Arg Ser Ala Arg Ser Arg Arg Ser 
275 280 285 

Leu Pro Ala Glu Met Pro Leu lie Pro Val Asn Gin Ala Asp Thr Leu 
290 295 300 

lie val Asp Gly Glu Ala val ser Leu Thr Gly His ser Gly Asn lie 
305 310 315 320 

Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala Glu Lys 
325 330 335 

Leu Pro Gly Gly Ser Tyr Ala Leu Arg Val Gin Gly Glu Pro Ser Lys 
340 345 350 

Gly Glu Met Leu Ala Gly Thr Ala val Tyr Asn Gly Glu val Leu His 
355 360 365 

Phe His Thr Glu Asn Gly Arg Pro Ser Pro Ser Arg Gly Arg Phe Ala 
370 375 380 

Ala Lys val Asp Phe Gly Ser Lys Ser val Asp Gly lie lie Asp Ser 
385 390 395 400 

Gly Asp Gly Leu His Met Gly Thr Gin Lys Phe Lys Ala Ala lie Asp 
405 410 415 

Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Gly Gly Asp Val 
420 425 430 

Ser Gly Lys Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly Lys Tyr 
435 440 445 

ser Tyr Arg Pro Thr Asp Ala Glu Lys Gly Gly Phe Gly Val Phe Ala 
450 455 460 

Gly Lys Lys Glu Gin Asp Gly Ser Gly Gly Gly Gly Ala Thr Asn Asp 
465 470 475 480 

Asp Asp val Lys Lys Ala Ala Thr val Ala lie Ala Ala Ala Tyr Asn 
485 490 495 

Asn Gly Gin Glu He Asn Gly Phe Lys Ala Gly Glu Thr lie Tyr Asp 
500 505 510 

lie Asp Glu Asp Gly Thr lie Thr Lys Lys Asp Ala Thr Ala Ala Asp 
515 520 525 

val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys Lys val val Thr 
530 535 540 

Asn Leu Thr Lys Thr val Asn Glu Asn Lys Gin Asn val Asp Ala Lys 
545 550 555 560 

val Lys Ala Ala Glu ser Glu lie Glu Lys Leu Thr Thr Lys Leu Ala 
565 570 575 

Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr 
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580 585 590 

Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn lie Thr Thr Phe Ala Glu 

595 600 605 

Glu Thr Lys Thr Asn lie Val Lys He Asp Glu Lys Leu Glu Ala Val 
610 615 620 

Ala Asp Thr val Asp Lys His Ala Glu Ala Phe Asn Asp lie Ala Asp 
625 630 635 640 

ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala val Lys Thr Ala 
645 650 655 

Asn Glu Ala Lys Gin Thr Ala Glu Glu Thr Lys Gin Asn Val Asp Ala 
660 665 670 

Lys val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala 
675 680 685 

Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala Val Ala Ala Lys 
690 695 700 

Val Thr Asp lie Lys Ala Asp lie Ala Thr Asn Lys Asp Asn lie Ala 
705 710 715 720 

Lys Lys Ala Asn ser Ala Asp val Tyr Thr Arg Glu Glu ser Asp ser 
725 730 735 

Lys Phe val Arg lie Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp 
740 745 750 

Thr Arg Leu Ala Ser Ala Glu Lys Ser lie Ala Asp His Asp Thr Arg 
755 760 765 

Leu Asn Gly Leu Asp Lys Thr val Ser Asp Leu Arg Lys Glu Thr Arg 
770 775 780 

Gin Gly Leu Ala Glu Gin Ala Ala Leu Ser Gly Leu Phe Gin Pro Tyr 
785 790 795 800 

Asn val Gly Arg Phe Asn val Thr Ala Ala val Gly Gly Tyr Lys ser 
805 810 815 

Glu Ser Ala val Ala lie Gly Thr Gly Phe Arg Phe Thr Glu Asn Phe 
820 825 830 

Ala Ala Lys Ala Gly Val Ala Val Gly Thr Ser Ser Gly Ser Ser Ala 
835 840 845 

Ala Tyr His Val Gly Val Asn Tyr Glu Trp 
850 855 

<210> 100 
<211> 4425 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> del taG983-0RF46. 1 
<400> 100 

atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 
acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 
agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 
aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 
aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 
ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 
aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 



154 



EP 1 790 660 A2 



cctgaagacg gaggcggtaa agacattgaa 
actgaagcaa agccgacgga tatccgccac 
tcccatatta ttggcgggcg ttccgtggac 

s gcgacgctac acataatgaa tacgaatgat 

atccgcaatg catgggtcaa gctgggcgaa 
ggaacaacat cgagggcagg cactgccgac 
taccgccaag cgttgctcga ctattccggc 
atgcaacaga gcgattacgg caacctgtcc 
atcttttcga caggcaatga cgcacaagct 
tatgaaaaag acgctcaaaa aggcattatc 

10 aagttcaaac gggaaatgta tggagaaccg 

cattgcggaa ttactgccat gtggtgcctg 
acccgtacaa acccgattca aattgccgga 
acggcggctc tgctgctgca gaaatacccg 
ttgctgacga cggctcagga catcggtgca 
ctgctggatg cgggtaaggc catgaacgga 

15 gccgatacga aaggtacatc cgatattgcc 

ggcggcctga tcaaaaaagg cggcagccaa 
ggcaaaacca ttatcgaagg cggttcgctg 
cgcgtcgaaa ccaaaggtgc gctgatttat 
agcgacggca ttgtctatct ggcagatacc 
atcaaaggca gtctgcagct ggacggcaaa 

20 ctgaaagtgg acggtacggc gattatcggc 

ggggcaggct atctcaacag taccggacga 
gggcaggatt attctttctt cacaaacatc 
gacagcgtcg aaaaaacagc gggcagtgaa 
ggcaatgcgg cacggactgc ttcggcagcg 
gccgtagaac agggcggcag caatctggaa 
tcatccgcaa cacccgagac ggttgaaact 

25 atccgcccct acggcgcaac tttccgcgca 

gacggtgtac gcatcttcaa cagtctcgcc 
catgccgata tgcagggacg ccgcctgaaa 
acgggtctgc gcgtcatcgc gcaaacccaa 
gttgaaggca aaatgcgcgg cagtacccaa 
aatacgacag cagccgccac actgggcatg 

30 aatgcaaaaa ccgacagcat tagtctgttt 

ggctatctca aaggcctgtt ctcctacgga 
ggtgcggacg aacatgcgga aggcagcgtc 
ggcggtgtca acgttccgtt tgccgcaacg 
tacgacctgc tcaaacagga tgcattcgcc 
aacagcctca ctgaaggcac gctggtcgga 

35 agcgataaag ccgtcctgtt tgcaacggcg 

tacacggtaa cgggcggctt taccggcgcg 
aatatgccgc acacccgtct ggttgccggc 
tggaacggct tggcacgtta cagctacgcc 
cgagtcggcg taggctaccg gttcctcgac 
gcaaacgatt cttttatccg gcaggttctc 
taccacctat tcggcagcag gggggaactt 

40 aaaatacaaa gccatcagtt gggcaacctg 

atcggctaca ttgtccgctt ttccgatcac 
catgcctcac attccgattc tgatgaagcc 
cgcatccatt gggacggata cgaacaccat 
ggcggctatc ccgctcccaa aggcgcgagg 
gcccaaaata tccgcctcaa cctgaccgac 

45 cgtttccaca atgccggtag tatgctgacg 

acccgataca gccccgagct ggacagatcg 
gcagatatcg ttaaaaacat catcggcgcg 
gtgcagggca taagcgaagg ctcaaacatt 
accgaaaaca agatggcgcg catcaacgat 
gccgcagcag ccatccgcga ttgggcagtc 

50 gccgtcagca atatctttat ggcagccatc 

aaatacggct tgggcggcat cacggcacat 
gcattgccga aagggaaatc cgccgtcagc 
tacccgtccc cttaccattc ccgaaatatc 
gaaaacatca cctcctcaac cgtgccgccg 
caacgccacc cgaagacagg cgtaccgttt 
cacgtgaaat atgatacgct cgagcaccac 

55 ~ ^ 

<210> 101 



gcttctttcg acgatgaggc cgttatagag 480 

gtaaaagaaa tcggacacat cgatttggtc 540 

ggcagacctg caggcggtat tgcgcccgat 600 

gaaaccaaga acgaaatgat ggttgcagcc 660 

cgtggcgtgc gcatcgtcaa taacagtttt 720 

cttttccaaa tagccaattc ggaggagcag 780 

ggtgataaaa cagacgaggg tatccgcctg 840 

taccacatcc gtaataaaaa catgcttttc 900 

cagcccaaca catatgccct attgccattt 960 

acagtcgcag gcgtagaccg cagtggagaa 1020 

ggtacagaac cgcttgagta tggctccaac 1080 

tcggcaccct atgaagcaag cgtccgtttc 1140 

acatcctttt ccgcacccat cgtaaccggc 1200 

tggatgagca acgacaacct gcgtaccacg 1260 

gtcggcgtgg acagcaagtt cggctgggga 1320 

cccgcgtcct ttccgttcgg cgactttacc 1380 

tactccttcc gtaacgacat ttcaggcacg 1440 

ctgcaactgc acggcaacaa cacctatacg 1500 

gtgttgtacg gcaacaacaa atcggatatg 1560 

aacggggcgg catccggcgg cagcctgaac 1620 

gaccaatccg gcgcaaacga aaccgtacac 1680 

ggtacgctgt acacacgttt gggcaaactg 1740 

ggcaagctgt acatgtcggc acgcggcaag 1800 

cgtgttccct tcctgagtgc cgccaaaatc 1860 

gaaaccgacg gcggcctgct ggcttccctc 1920 

ggcgacacgc tgtcctatta tgtccgtcgc 1980 

gcacattccg cgcccgccgg tctgaaacac 2040 

aacctgatgg tcgaactgga tgcctccgaa 2100 

gcggcagccg accgcacaga tatgccgggc 2160 

gcggcagccg tacagcatgc gaatgccgcc 2220 

gctaccgtct atgccgacag taccgccgcc 2280 

gccgtatcgg acgggttgga ccacaacggc 2340 

caggacggtg gaacgtggga acagggcggt 2400 

accgtcggca ttgccgcgaa aaccggcgaa 2460 

ggacgcagca catggagcga aaacagtgca 2 520 

gcaggcatac ggcacgatgc gggcgatatc 2580 

cgctacaaaa acagcatcag ccgcagcacc 2640 

aacggcacgc tgatgcagct gggcgcactg 2700 

ggagatttga cggtcgaagg cggtctgcgc 2760 

gaaaaaggca gtgctttggg ctggagcggc 2820 

ctcgcgggtc tgaagctgtc gcaacccttg 2880 

ggcgtggaac gcgacctgaa cggacgcgac 2940 

actgcagcaa ccggcaagac gggggcacgc 3000 

ctgggcgcgg atgtcgaatt cggcaacggc 3060 

ggttccaaac agtacggcaa ccacagcgga 3120 

ggtggcggag gcactggatc ctcagatttg 3180 

gaccgtcagc atttcgaacc cgacgggaaa 3240 

gccgagcgca gcggccatat cggattggga 3300 

atgattcaac aggcggccat taaaggaaat 3360 

gggcacgaag tccattcccc cttcgacaac 3420 

ggtagtcccg ttgacggatt tagcctttac 3480 

cccgccgacg gctatgacgg gccacagggc 3540 

gatatataca gctacgacat aaaaggcgtt 3600 

aaccgcagca ccggacaacg gcttgccgac 3660 

caaggagtag gcgacggatt caaacgcgcc 3720 

ggcaatgccg ccgaagcctt caacggcact 3780 

gcaggagaaa ttgtcggcgc aggcgatgcc 3840 

gctgtcatgc acggcttggg tctgctttcc 3900 

ttggcagata tggcgcaact caaagactat 3960 

caaaacccca atgccgcaca aggcatagaa 4020 

cccatcaaag ggattggagc tgttcgggga 4080 

cctatcaagc ggtcgcagat gggcgcgatc 4140 

gacaattttg ccgatgcggc atacgccaaa 4200 

cgttcaaact tggagcagcg ttacggcaaa 4260 

tcaaacggca aaaatgtcaa actggcagac 4320 

gacggtaaag ggtttccgaa ttttgagaag 4380 

caccaccacc actga 4425 
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<211> 1474 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> del taG983-ORF46 . 1 

<400> 101 

Met Thr ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly lie Gly Ser 
1 5 10 15 

Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala val Ser Tyr Ala Gly 
20 25 30 

lie Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg 
35 40 45 

Asp Asp val Ala val Thr Asp Arg Asp Ala Lys lie Asn Ala Pro Pro 
50 55 60 

Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys 
65 70 75 80 

Asn Leu lie Asn Leu Lys Pro Ala lie Glu Ala Gly Tyr Thr Gly Arg 
85 90 95 

Gly val Glu val Gly lie val Asp Thr Gly Glu ser val Gly Ser lie 
100 105 110 

Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn 
115 120 125 

Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly 
130 135 140 

Gly Gly Lys Asp lie Glu Ala Ser Phe Asp Asp Glu Ala val lie Glu 
145 150 155 160 

Thr Glu Ala Lys Pro Thr Asp lie Arg His Val Lys Glu lie Gly His 
165 170 175 

lie Asp Leu Val Ser His lie lie Gly Gly Arg Ser val Asp Gly Arg 
180 185 190 

Pro Ala Gly Gly lie Ala Pro Asp Ala Thr Leu His He Met Asn Thr 
195 200 205 

Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala lie Arg Asn Ala 
210 215 220 

Trp val Lys Leu Gly Glu Arg Gly val Arg lie val Asn Asn ser Phe 
225 230 235 240 

Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gin lie Ala Asn 
245 250 255 

Ser Glu Glu Gin Tyr Arg Gin Ala Leu Leu Asp Tyr Ser Gly Gly Asp 
260 265 270 

Lys Thr Asp Glu Gly lie Arg Leu Met Gin Gin ser Asp Tyr Gly Asn 
275 280 285 

Leu Ser Tyr His lie Arg Asn Lys Asn Met Leu Phe lie Phe Ser Thr 
290 295 300 

Gly Asn Asp Ala Gin Ala Gin Pro Asn Thr Tyr Ala Leu Leu Pro Phe 
305 310 315 320 

Tyr Glu Lys Asp Ala Gin Lys Gly lie He Thr Val Ala Gly Val Asp 
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325 330 335 

Arg Ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr 
340 345 350 

Glu Pro Leu Glu Tyr Gly ser Asn His cys Gly lie Thr Ala Met Trp 
355 360 365 

Cys Leu Ser Ala Pro Tyr Glu Ala ser val Arg phe Thr Arg Thr Asn 
370 375 380 

Pro He Gin lie Ala Gly Thr Ser Phe Ser Ala Pro lie val Thr Gly 
385 390 395 400 

Thr Ala Ala Leu Leu Leu Gin Lys Tyr Pro Trp Met Ser Asn Asp Asn 
405 410 415 

Leu Arg Thr Thr Leu Leu Thr Thr Ala Gin Asp lie Gly Ala Val Gly 
420 425 430 

Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met 
435 440 445 

Asn Gly Pro Ala Ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys 
450 455 460 

Gly Thr Ser Asp lie Ala Tyr Ser Phe Arg Asn Asp lie Ser Gly Thr 
465 470 475 480 

Gly Gly Leu lie Lys Lys Gly Gly Ser Gin Leu Gin Leu His Gly Asn 
485 490 495 

Asn Thr Tyr Thr Gly Lys Thr lie lie Glu Gly Gly Ser Leu Val Leu 
500 505 510 

Tyr Gly Asn Asn Lys Ser Asp Met Arg val Glu Thr Lys Gly Ala Leu 
515 520 525 

lie Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly lie 
530 535 540 

val Tyr Leu Ala Asp Thr Asp Gin Ser Gly Ala Asn Glu Thr val His 
545 550 555 560 

lie Lys Gly Ser Leu Gin Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg 
565 570 575 

Leu Gly Lys Leu Leu Lys Val Asp Gly Thr Ala lie lie Gly Gly Lys 
580 585 590 

Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn Ser Thr 
595 600 605 

Gly Arg Arg val Pro Phe Leu Ser Ala Ala Lys lie Gly Gin Asp Tyr 
610 615 620 

Ser Phe Phe Thr Asn lie Glu Thr Asp Gly Gly Leu Leu Ala Ser Leu 
625 630 635 640 

Asp ser val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr Leu Ser Tyr 
645 650 655 

Tyr val Arg Arg Gly Asn Ala Ala Arg Thr Ala ser Ala Ala Ala His 
660 665 670 

ser Ala pro Ala Gly Leu Lys His Ala val Glu Gin Gly Gly ser Asn 
675 680 685 

Leu Glu Asn Leu Met val Glu Leu Asp Ala ser Glu ser ser Ala Thr 
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690 695 700 

Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly 
705 710 715 720 

lie Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala val Gin His 

725 730 735 

Ala Asn Ala Ala Asp Gly val Arg lie Phe Asn Ser Leu Ala Ala Thr 
740 745 750 

val Tyr Ala Asp ser Thr Ala Ala His Ala Asp Met Gin Gly Arg Arg 
755 760 765 

Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg 
770 775 780 

Val lie Ala Gin Thr Gin Gin Asp Gly Gly Thr Trp Glu Gin Gly Gly 
785 790 795 800 

val Glu Gly Lys Met Arg Gly Ser Thr Gin Thr Val Gly lie Ala Ala 
805 810 815 

Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg 
820 825 830 

Ser Thr Trp ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser He Ser 
835 840 845 

Leu Phe Ala Gly lie Arg His Asp Ala Gly Asp lie Gly Tyr Leu Lys 
850 855 860 

Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser lie Ser Arg Ser Thr 
865 870 875 880 

Gly Ala Asp Glu His Ala Glu Gly Ser val Asn Gly Thr Leu Met Gin 
885 890 895 

Leu Gly Ala Leu Gly Gly Val Asn val Pro Phe Ala Ala Thr Gly Asp 
900 905 910 

Leu Thr val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gin Asp Ala 
915 920 925 

Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr 
930 935 940 

Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser Gin Pro Leu 
945 950 955 960 

Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly val Glu Arg Asp Leu 
965 970 975 

Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala 
980 985 990 

Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu val 
995 1000 1005 

Ala Gly Leu Gly Ala Asp val Glu Phe Gly Asn Gly Trp Asn Gly Leu 
1010 1015 1020 

Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gin Tyr Gly Asn His Ser Gly 
1025 1030 1035 1040 

Arg Val Gly Val Gly Tyr Arg Phe Leu Asp Gly Gly Gly Gly Thr Gly 
1045 1050 1055 

ser ser Asp Leu Ala Asn Asp Ser Phe He Arg Gin val Leu Asp Arg 
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1060 1065 1070 

Gin His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly Ser Arg Gly 
1075 1080 1085 

Glu Leu Ala Glu Arg Ser Gly His lie Gly Leu Gly Lys He Gin Ser 
1090 1095 1100 

His Gin Leu Gly Asn Leu Met lie Gin Gin Ala Ala lie Lys Gly Asn 
1105 1110 1115 1120 

lie Gly Tyr lie Val Arg Phe Ser Asp His Gly His Glu val His Ser 
112 5 1130 1135 

Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser 
1140 1145 1150 

Pro val Asp Gly Phe Ser Leu Tyr Arg lie His Trp Asp Gly Tyr Glu 
1155 1160 1165 

His His Pro Ala Asp Gly Tyr Asp Gly Pro Gin Gly Gly Gly Tyr Pro 
1170 1175 1180 

Ala Pro Lys Gly Ala Arg Asp lie Tyr Ser Tyr Asp lie Lys Gly Val 
1185 1190 1195 1200 

Ala Gin Asn lie Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gin 
1205 1210 1215 

Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gin Gly 
1220 1225 1230 

val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu Asp 
1235 1240 1245 

Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp lie Val 
1250 1255 1260 

Lys Asn lie lie Gly Ala Ala Gly Glu lie Val Gly Ala Gly Asp Ala 
1265 1270 1275 1280 

Val Gin Gly lie Ser Glu Gly Ser Asn lie Ala val Met His Gly Leu 
1285 1290 1295 

Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg lie Asn Asp Leu Ala 
1300 1305 1310 

Asp Met Ala Gin Leu Lys Asp Tyr Ala Ala Ala Ala lie Arg Asp Trp 
1315 1320 1325 

Ala val Gin Asn Pro Asn Ala Ala Gin Gly lie Glu Ala val ser Asn 
1330 1335 1340 

lie Phe Met Ala Ala lie Pro lie Lys Gly lie Gly Ala val Arg Gly 
1345 1350 1355 1360 

Lys Tyr Gly Leu Gly Gly lie Thr Ala His Pro lie Lys Arg ser Gin 
1365 1370 1375 

Met Gly Ala lie Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp Asn 
1380 1385 1390 

Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg 
1395 1400 1405 

Asn He Arg Ser Asn Leu Glu Gin Arg Tyr Gly Lys Glu Asn lie Thr 
1410 1415 1420 

Ser ser Thr Val Pro Pro Ser Asn Gly Lys Asn val Lys Leu Ala Asp 
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1435 



1440 



Gin Arg His Pro Lys Thr Gly val Pro Phe Asp Gly Lys Gly Phe Pro 
1445 1450 1455 

Asn Phe Glu Lys His val Lys Tyr Asp Thr Leu Glu His His His His 
1460 1465 1470 

His His 



10 



15 



<210> 102 

<211> 3939 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



deltaG983-741 



20 



25 



30 



35 



40 



45 



50 



55 



<400> 102 

atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60 

acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120 

agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180 

aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240 

aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300 

ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360 

aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420 

cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480 

actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540 

tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600 

gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660 

atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720 

ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780 

taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840 

atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900 

atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960 

tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020 

aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080 

cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140 

acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200 

acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260 

ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320 

ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380 

gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440 

ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500 

ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560 

cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620 

agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680 

atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740 

ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800 

ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860 

gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920 

gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980 

ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040 

gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100 

tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160 

atccgcccct acggcgcaac tttccgcgca gcggcagccg tacagcatgc gaatgccgcc 2220 

gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280 

catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340 

acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400 

gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460 

aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520 

aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac ggcacgatgc gggcgatatc 2580 

ggctatctca aaggcctgtt ctcctacgga cgctacaaaa acagcatcag ccgcagcacc 2640 

ggtgcggacg aacatgcgga aggcagcgtc aacggcacgc tgatgcagct gggcgcactg 2700 

ggcggtgtca acgttccgtt tgccgcaacg ggagatttga cggtcgaagg cggtctgcgc 2760 

tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca gtgctttggg ctggagcggc 2820 

aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc tgaagctgtc gcaacccttg 2880 

agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac gcgacctgaa cggacgcgac 2940 
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tacacggtaa cgggcggctt taccggcgcg actgcagcaa ccggcaagac gggggcacgc 

aatatgccgc acacccgtct ggttgccggc ctgggcgcgg atgtcgaatt cggcaacggc 

tggaacggct tggcacgtta cagctacgcc ggttccaaac agtacggcaa ccacagcgga 

cgagtcggcg taggctaccg gttcctcgag ggatccggag ggggtggtgt cgccgccgac 

atcggtgcgg ggcttgccga tgcactaacc gcaccgctcg accataaaga caaaggtttg 

cagtctttga cgctggatca gtccgtcagg aaaaacgaga aactgaagct ggcggcacaa 

ggtgcggaaa aaacttatgg aaacggtgac agcctcaata cgggcaaatt gaagaacgac 

aaggtcagcc gtttcgactt tatccgccaa atcgaagtgg acgggcagct cattaccttg 

gagagtggag agttccaagt atacaaacaa agccattccg ccttaaccgc ctttcagacc 

gagcaaatac aagattcgga gcattccggg aagatggttg cgaaacgcca gttcagaatc 

ggcgacatag cgggcgaaca tacatctttt gacaagcttc ccgaaggcgg cagggcgaca 

tatcgcggga cggcgttcgg ttcagacgat gccggcggaa aactgaccta caccatagat 

ttcgccgcca agcagggaaa cggcaaaatc gaacatttga aatcgccaga actcaatgtc 

gacctggccg ccgccgatat caagccggat ggaaaacgcc atgccgtcat cagcggttcc 

gtcctttaca accaagccga gaaaggcagt tactccctcg gtatctttgg cggaaaagcc 

caggaagttg ccggcagcgc ggaagtgaaa accgtaaacg gcatacgcca tatcggcctt 

gccgccaagc aactcgagca ccaccaccac caccactga 

<210> 103 

<211> 1312 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> deltaG983-741 

<400> 103 

Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly lie Gly Ser 
1 5 10 15 

Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly 
20 25 30 

lie Lys Asn Glu Met Cys Lys Asp Arg Ser Met Leu Cys Ala Gly Arg 
35 40 45 

Asp Asp val Ala val Thr Asp Arg Asp Ala Lys He Asn Ala Pro Pro 
50 55 60 

Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys 
65 70 75 80 

Asn Leu lie Asn Leu Lys Pro Ala lie Glu Ala Gly Tyr Thr Gly Arg 
85 90 95 

Gly val Glu val Gly lie val Asp Thr Gly Glu Ser val Gly Ser lie 
100 105 110 

Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn 
115 120 125 

Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly 
130 * 135 ~ 140 

Gly Gly Lys Asp lie Glu Ala Ser Phe Asp Asp Glu Ala Val lie Glu 
145 150 155 160 

Thr Glu Ala Lys Pro Thr Asp lie Arg His val Lys Glu lie Gly His 
165 170 175 

lie Asp Leu Val Ser His lie lie Gly Gly Arg Ser Val Asp Gly Arg 
180 185 190 

Pro Ala Gly Gly lie Ala Pro Asp Ala Thr Leu His lie Met Asn Thr 
195 200 205 

Asn Asp Glu Thr Lys Asn Glu Met Met Val Ala Ala lie Arg Asn Ala 
210 215 220 
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Trp val Lys 

225 

Gly Thr Thr 

Ser Glu Glu 

Lys Thr Asp 
275 

Leu Ser Tyr 
290 

Gly Asn Asp 
305 

Tyr Glu Lys 
Arg Ser Gly 

Glu Pro Leu 

355 

cys Leu ser 
370 

Pro lie Gin 
385 

Thr Ala Ala 

Leu Arg Thr 

val Asp ser 
435 

Asn Gly pro 
450 

Gly Thr Ser 
465 

Gly Gly Leu 

Asn Thr Tyr 

Tyr Gly Asn 
515 

lie Tyr Asn 
530 

val Tyr Leu 
545 

lie Lys Gly 
Leu Gly Lys 



Leu Gly 

Ser Arg 
245 

Gin Tyr 
260 

Glu Gly 

His lie 

Ala Gin 

Asp Ala 
325 

Glu Lys 
340 

Glu Tyr 

Ala Pro 

lie Ala 

Leu Leu 
405 

Thr Leu 
420 

Lys Phe 

Ala ser 

Asp lie 

lie Lys 
485 

Thr Gly 
500 

Asn Lys 
Gly Ala 
Ala Asp 



Ser Leu 
565 

Leu Leu 
580 



Glu Arg 
230 

Ala Gly 

Arg Gin 

lie Arg 

Arg Asn 
295 

Ala Gin 

310 

Gin Lys 
Phe Lys 
Gly Ser 

Tyr Glu 

375 

Gly Thr 
390 

Leu Gin 
Leu Thr 
Gly Trp 

Phe Pro 

455 

Ala Tyr 
470 

Lys Gly 

Lys Thr 

Ser Asp 

Ala Ser 
535 

Thr Asp 
550 

Gin Leu 
Lys val 



Gly val 

Thr Ala 

Ala Leu 
265 

Leu Met 
280 

Lys Asn 

Pro Asn 

Gly lie 

Arg Glu 
345 

Asn His 
360 

Ala Ser 

ser Phe 

Lys Tyr 

Thr Ala 
425 

Gly Leu 
440 

Phe Gly 

Ser Phe 

Gly Ser 

He He 
505 

Met Arg 
520 

Gly Gly 
Gin Ser 
Asp Gly 



Asp Gly 
585 



Arg lie val 
235 

Asp Leu Phe 
250 

Leu Asp Tyr 

Gin Gin Ser 

Met Leu Phe 
300 

Thr Tyr Ala 
315 

lie Thr val 
330 

Met Tyr Gly 

Cys Gly He 

val Arg Phe 
380 

ser Ala Pro 

395 

Pro Trp Met 
410 

Gin Asp lie 

Leu Asp Ala 

Asp Phe Thr 
460 

Arg Asn Asp 
475 

Gin Leu Gin 
490 

Glu Gly Gly 
Val Glu Thr 



Ser Leu Asn 
540 

Gly Ala Asn 
555 

Lys Gly Thr 
570 

Thr Ala lie 



Asn Asn ser Phe 
240 

Gin lie Ala Asn 

255 

ser Gly Gly Asp 
270 

Asp Tyr Gly Asn 
285 

lie Phe ser Thr 



Leu Leu 
Ala Gly 

Glu Pro 

350 

Thr Ala 
365 

Thr Arg 

lie val 

Ser Asn 

Gly Ala 
430 

Gly Lys 
445 

Ala Asp 

lie Ser 

Leu His 

Ser Leu 
510 

Lys Gly 
52 5 

Ser Asp 
Glu Thr 
Leu Tyr 



lie Gly 
590 



Pro Phe 

320 

val Asp 
335 

Gly Thr 

Met Trp 

Thr Asn 

Thr Gly 
400 

Asp Asn 
415 

val Gly 

Ala Met 

Thr Lys 

Gly Thr 
480 

Gly Asn 
495 

Val Leu 
Ala Leu 
Gly He 



Val His 
560 

Thr Arg 
575 

Gly Lys 



162 



EP 1 790 660 A2 



Leu Tyr 

Gly Arg 
610 

Ser Phe 

625 

Asp Ser 

Tyr Val 

Ser Ala 

Leu Glu 
690 

Pro Glu 
705 

lie Arg 

Ala Asn 

Val Tyr 

Leu Lys 
770 

val He 
785 

Val Glu 

Lys Thr 

Ser Thr 

Leu Phe 
850 

Gly Leu 
865 

Gly Ala 
Leu Gly 
Leu Thr 



Phe Ala 
930 

Glu Gly 
945 



Met Ser 
595 

Arg val 
Phe Thr 
val Glu 



Ala Arg 
Pro Phe 



Asn lie 
630 

Lys Thr 
645 



Gly Asn 
Gly Leu 
Asn Leu Met Val 



Arg Arg 
660 

Pro Ala 
675 



Thr val 

Pro Tyr 

Ala Ala 
740 

Ala Asp 
755 

Ala val 
Ala Gin 

Gly Lys 

Gly Glu 
820 

Trp Ser 
835 

Ala Gly 

Phe Ser 

Asp Glu 

Ala Leu 
900 

val Glu 
915 

Glu Lys 
Thr Leu 



Glu Thr 
710 

Gly Ala 
725 

Asp Gly 

Ser Thr 

ser Asp 

Thr Gin 
790 

Met Arg 
805 

Asn Thr 

Glu Asn 

lie Arg 

Tyr Gly 
870 

His Ala 
885 

Gly Gly 
Gly Gly 
Gly Ser 



Val Gly 
950 



Gly Lys 
600 

Leu Ser 
615 

Glu Thr 

Ala Gly 

Ala Ala 

Lys His 
680 

Glu Leu 

695 

Ala Ala 

Thr Phe 

Val Arg 

Ala Ala 
760 

Gly Leu 
775 

Gin Asp 

Gly Ser 

Thr Ala 

Ser Ala 
840 

His Asp 

855 

Arg Tyr 
Glu Gly 
val Asn 



Gly Ala Gly Tyr 



Ala Ala Lys lie 
620 

Asp Gly Gly Leu 
635 

Ser Glu Gly Asp 
650 

Arg Thr Ala Ser 
665 

Ala val Glu Gin 



Leu Asn 
605 

Gly Gin 
Leu Ala 
Thr Leu 



ser Thr 
Asp Tyr 



Ser Leu 
640 

Ser Tyr 
655 



Ala Ala 
670 

Gly Gly 
685 



Asp Ala 

Ala Asp 

Arg Ala 
730 

lie Phe 

745 

His Ala 

ASP His 

Gly Gly 

Thr Gin 
810 

Ala Ala 

825 

Asn Ala 

Ala Gly 

Lys Asn 

ser val 
890 

Val Pro 
905 



Ser Glu 
700 



Ala His 
Ser Asn 
Ser Ser Ala Thr 



Arg Thr 
715 



Asp Met 
Ala Ala Ala val 



Asn Ser 

Asp Met 

Asn Gly 
780 

Thr Trp 

795 

Thr val 

Thr Leu 

Lys Thr 

Asp lie 
860 

Ser lie 
875 

Asn Gly 
Phe Ala 



Leu Ala 
750 

Gin Gly 
765 

Thr Gly 

Glu Gin 

Gly He 

Gly Met 
830 

Asp Ser 
845 

Gly Tyr 
Ser Arg 
Thr Leu 



Pro Gly 
720 

Gin His 

735 

Ala Thr 

Arg Arg 

Leu Arg 

Gly Gly 
800 

Ala Ala 
815 

Gly Arg 
lie Ser 
Leu Lys 



Ser Thr 
880 

Met Gin 
895 



Ala Thr 
910 



Leu Arg 
920 



Tyr Asp Leu Leu 
Gly Trp 
Leu Ala Gly Leu 



Ala Leu 
935 



Ser Gly 
940 



Gly Asp 
Asp Ala 
Asn Ser Leu Thr 



Lys Gin 
925 



Lys Leu 
955 



Ser Gin Pro Leu 
960 
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ser Asp Lys Ala val Leu Phe Ala Thr Ala Gly val Glu Arg Asp Leu 
965 970 975 

Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala 
980 985 990 

Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu val 
995 1000 1005 

Ala Gly Leu Gly Ala Asp val Glu Phe Gly Asn Gly Trp Asn Gly Leu 
1010 1015 1020 

Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gin Tyr Gly Asn His Ser Gly 
1025 1030 1035 1040 

Arg Val Gly Val Gly Tyr Arg Phe Leu Glu Gly Ser Gly Gly Gly Gly 
1045 1050 1055 

Val Ala Ala Asp lie Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro 
1060 1065 1070 

Leu Asp His Lys Asp Lys Gly Leu Gin Ser Leu Thr Leu Asp Gin Ser 
1075 1080 1085 

val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gin Gly Ala Glu Lys 
1090 1095 1100 

Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp 
1105 1110 1115 1120 

Lys Val Ser Arg Phe Asp Phe lie Arg Gin lie Glu Val Asp Gly Gin 
1125 1130 1135 

Leu lie Thr Leu Glu Ser Gly Glu Phe Gin val Tyr Lys Gin Ser His 
1140 1145 1150 

Ser Ala Leu Thr Ala Phe Gin Thr Glu Gin lie Gin Asp Ser Glu His 
1155 1160 1165 

Ser Gly Lys Met val Ala Lys Arg Gin Phe Arg lie Gly Asp lie Ala 
1170 1175 1180 

Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr 
1185 1190 1195 ~ 1200 

Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr 
1205 1210 1215 

Tyr Thr lie Asp Phe Ala Ala Lys Gin Gly Asn Gly Lys lie Glu His 
1220 1225 1230 

Leu Lys Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp lie Lys 
1235 1240 1245 

Pro Asp Gly Lys Arg His Ala val lie Ser Gly Ser Val Leu Tyr Asn 
1250 1255 1260 

Gin Ala Glu Lys Gly Ser Tyr Ser Leu Gly lie Phe Gly Gly Lys Ala 
1265 1270 1275 1280 

Gin Glu Val Ala Gly Ser Ala Glu Val Lys Thr Val Asn Gly lie Arg 
1285 1290 1295 

His lie Gly Leu Ala Ala Lys Gin Leu Glu His His His His His His 
1300 1305 1310 

<210> 104 
<211> 4344 
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<212> DNA 

<213> Artificial sequence 



104 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



<220> 
<223> 

<400> 
atgacttctg 
acaacagcga 
agaagcatgc 
aatgcccccc 
aatttgatca 
ggtatcgtcg 
aaagaacacg 
cctgaagacg 
actgaagcaa 
tcccatatta 
gcgacgctac 
atccgcaatg 
ggaacaacat 
taccgccaag 
atgcaacaga 
atcttttcga 
tatgaaaaag 
aagttcaaac 
cattgcggaa 
acccgtacaa 
acggcggctc 
ttgctgacga 
ctgctggatg 
gccgatacga 
ggcggcctga 
ggcaaaacca 
cgcgtcgaaa 
agcgacggca 
atcaaaggca 
ctgaaagtgg 
ggggcaggct 
gggcaggatt 
gacagcgtcg 
ggcaatgcgg 
gccgtagaac 
tcatccgcaa 
atccgcccct 
gacggtgtac 
catgccgata 
acgggtctgc 
gttgaaggca 
aatacgacag 
aatgcaaaaa 
ggctatctca 
ggtgcggacg 
ggcggtgtca 
tacgacctgc 
aacagcctca 
agcgataaag 
tacacggtaa 
aatatgccgc 
tggaacggct 
cgagtcggcg 
gacgacgatg 
gaaatcaacg 
accaaaaaag 
aaaaaagtcg 
aaagtaaaag 
gccgctttag 
ggagaaaata 
aaattagaag 
gattcattgg 
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cgcccgactt 
aatcagcagc 
tctgtgccgg 
ccccgaatct 
acctcaaacc 
acacaggcga 
gctataacga 
gaggcggtaa 
agccgacgga 
ttggcgggcg 
acataatgaa 
catgggtcaa 
cgagggcagg 
cgttgctcga 
gcgattacgg 
caggcaatga 
acgctcaaaa 
gggaaatgta 
ttactgccat 
acccgattca 
tgctgctgca 
cggctcagga 
cgggtaaggc 
aaggtacatc 
tcaaaaaagg 
ttatcgaagg 
ccaaaggtgc 
ttgtctatct 
gtctgcagct 
acggtacggc 
atctcaacag 
attctttctt 
aaaaaacagc 
cacggactgc 
agggcggcag 
cacccgagac 
acggcgcaac 
gcatcttcaa 
tgcagggacg 
gcgtcatcgc 
aaatgcgcgg 
cagccgccac 
ccgacagcat 
aaggcctgtt 
aacatgcgga 
acgttccgtt 
tcaaacagga 
ctgaaggcac 
ccgtcctgtt 
cgggcggctt 
acacccgtct 
tggcacgtta 
taggctaccg 
ttaaaaaagc 
gtttcaaagc 
acgcaactgc 
tgactaacct 
ctgcagaatc 
cagatactga 
taacgacatt 
ccgtggctga 
atgaaaccaa 



caatgcaggc 
agtatcttac 
tcgggatgac 
gcataccgga 
tgcaattgaa 
atccgtcggc 
aaattacaaa 
agacattgaa 
tatccgccac 
ttccgtggac 
tacgaatgat 
gctgggcgaa 
cactgccgac 
ctattccggc 
caacctgtcc 
cgcacaagct 
aggcattatc 
tggagaaccg 
gtggtgcctg 
aattgccgga 
gaaatacccg 
catcggtgca 
catgaacgga 
cgatattgcc 
cggcagccaa 
cggttcgctg 
gctgatttat 
ggcagatacc 
ggacggcaaa 
gattatcggc 
taccggacga 
cacaaacatc 
gggcagtgaa 
ttcggcagcg 
caatctggaa 
ggttgaaact 
tttccgcgca 
cagtctcgcc 
ccgcctgaaa 
gcaaacccaa 
cagtacccaa 
actgggcatg 
tagtctgttt 
ctcctacgga 
aggcagcgtc 
tgccgcaacg 
tgcattcgcc 
gctggtcgga 
tgcaacggcg 
taccggcgcg 
ggttgccggc 
cagctacgcc 
gttcctcgag 
tgccactgtg 
tggagagacc 
agccgatgtt 
gaccaaaacc 
tgaaatagaa 
tgccgctctg 
tgctgaagag 
taccgtcgac 
cactaaggca 



ggtaccggta 
gccggtatca 
gttgcggtta 
gactttccaa 
gcaggctata 
agcatatcct 
aactatacgg 
gcttctttcg 
gtaaaagaaa 
ggcagacctg 
gaaaccaaga 
cgtggcgtgc 
cttttccaaa 
ggtgataaaa 
taccacatcc 
cagcccaaca 
acagtcgcag 
ggtacagaac 
tcggcaccct 
acatcctttt 
tggatgagca 
gtcggcgtgg 
cccgcgtcct 
tactccttcc 
ctgcaactgc 
gtgttgtacg 
aacggggcgg 
gaccaatccg 
ggtacgctgt 
ggcaagctgt 
cgtgttccct 
gaaaccgacg 
ggcgacacgc 
gcacattccg 
aacctgatgg 
gcggcagccg 
gcggcagccg 
gctaccgtct 
gccgtatcgg 
caggacggtg 
accgtcggca 
ggacgcagca 
gcaggcatac 
cgctacaaaa 
aacggcacgc 
ggagatttga 
gaaaaaggca 
ctcgcgggtc 
ggcgtggaac 
actgcagcaa 
ctgggcgcgg 
ggttccaaac 
ggtggcggag 
gccattgctg 
atctacgaca 
gaagccgacg 
gtcaatgaaa 
aagttaacaa 
gatgcaacca 
actaagacaa 
aagcatgccg 
gacgaagccg 



tcggcagcaa 
agaacgaaat 
cagacaggga 
acccaaatga 
caggacgcgg 
ttcccgaact 
cgtatatgcg 
acgatgaggc 
tcggacacat 
caggcggtat 
acgaaatgat 
gcatcgtcaa 
tagccaattc 
cagacgaggg 
gtaataaaaa 
catatgccct 
gcgtagaccg 
cgcttgagta 
atgaagcaag 
ccgcacccat 
acgacaacct 
acagcaagtt 
ttccgttcgg 
gtaacgacat 
acggcaacaa 
gcaacaacaa 
catccggcgg 
gcgcaaacga 
acacacgttt 
acatgtcggc 
tcctgagtgc 
gcggcctgct 
tgtcctatta 
cgcccgccgg 
tcgaactgga 
accgcacaga 
tacagcatgc 
atgccgacag 
acgggttgga 
gaacgtggga 
ttgccgcgaa 
catggagcga 
ggcacgatgc 
acagcatcag 
tgatgcagct 
cggtcgaagg 
gtgctttggg 
tgaagctgtc 
gcgacctgaa 
ccggcaagac 
atgtcgaatt 
agtacggcaa 
gcactggatc 
ctgcctacaa 
ttgatgaaga 
actttaaagg 
acaaacaaaa 
ccaagttagc 
ccaacgcctt 
atatcgtaaa 
aagcattcaa 
tcaaaaccgc 



cagcagagca 
gtgcaaagac 
tgccaaaatc 
cgcatacaag 
ggtagaggta 
gtatggcaga 
gaaggaagcg 
cgttatagag 
cgatttggtc 
tgcgcccgat 
ggttgcagcc 
taacagtttt 
ggaggagcag 
tatccgcctg 
catgcttttc 
attgccattt 
cagtggagaa 
tggctccaac 
cgtccgtttc 
cgtaaccggc 
gcgtaccacg 
cggctgggga 
cgactttacc 
ttcaggcacg 
cacctatacg 
atcggatatg 
cagcctgaac 
aaccgtacac 
gggcaaactg 
acgcggcaag 
cgccaaaatc 
ggcttccctc 
tgtccgtcgc 
tctgaaacac 
tgcctccgaa 
tatgccgggc 
gaatgccgcc 
taccgccgcc 
ccacaacggc 
acagggcggt 
aaccggcgaa 
aaacagtgca 
gggcgatatc 
ccgcagcacc 
gggcgcactg 
cggtctgcgc 
ctggagcggc 
gcaacccttg 
cggacgcgac 
gggggcacgc 
cggcaacggc 
ccacagcgga 
cgccacaaac 
caatggccaa 
cggcacaatt 
tctgggtctg 
cgtcgatgcc 
agacactgat 
gaataaattg 
aattgatgaa 
cgatatcgcc 
caatgaagcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 

3420 

3480 

3540 

3600 

3660 

3720 
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aaacagacgg ccgaagaaac caaacaaaac gtcgatgcca aagtaaaagc tgcagaaact 3780 

gcagcaggca aagccgaagc tgccgctggc acagctaata ctgcagccga caaggccgaa 3840 

gctgtcgctg caaaagttac cgacatcaaa gctgatatcg ctacgaacaa agataatatt 3900 

gctaaaaaag caaacagtgc cgacgtgtac accagagaag agtctgacag caaatttgtc 3960 

agaattgatg gtctgaacgc tactaccgaa aaattggaca cacgcttggc ttctgctgaa 4020 

aaatccattg ccgatcacga tactcgcctg aacggtttgg ataaaacagt gtcagacctg 4080 

cgcaaagaaa cccgccaagg ccttgcagaa caagccgcgc tctccggtct gttccaacct 4140 

tacaacgtgg gtcggttcaa tgtaacggct gcagtcggcg gctacaaatc cgaatcggca 4200 

gtcgccatcg gtaccggctt ccgctttacc gaaaactttg ccgccaaagc aggcgtggca 4260 

gtcggcactt cgtccggttc ttccgcagcc taccatgtcg gcgtcaatta cgagtggctc 4320 

10 gagcaccacc accaccacca ctga 4344 



15 


<210> 
<211> 
<212> 
<213> 


105 

1447 

PRT 

Artificial Sequence 


















<220> 
<223> 


deltaG983-961 




















20 


<400> 
Met Thr 
1 


105 
Ser 


Ala 


Pro 

5 


ASp 


Phe 


Asn 


Ala Gly 
10 


Gly 


Thr 


Gly 


He 


Gly 
15 


Ser 




Asn 


Ser 


Arg 


Ala 
20 


Thr 


Thr 


Ala 


Lys 


Ser Ala 
25 


Ala 


val 


ser 


Tyr 
30 


Ala 


Gly 


25 


lie 


Lys 


Asn 
35 


Glu 


Met 


cys 


Lys 


ASp 
40 


Arg ser 


Met 


Leu 


Cys 
45 


Ala 


Gly 


Arg 




Asp 


Asp 
50 


val 


Ala 


val 


Thr 


ASp 

55 


Arg 


Asp Ala 


Lys 


He 
60 


Asn 


Ala 


Pro 


Pro 


30 


Pro 
65 


Asn 


Leu 


His 


Thr 


Gly 
70 


Asp 


Phe 


Pro Asn 


Pro 
75 


Asn 


Asp 


Ala 


Tyr 


Lys 
80 




Asn 


Leu 


He 


Asn 


Leu 

85 


Lys 


Pro 


Ala 


He Glu 
90 


Ala 


Gly 


Tyr 


Thr 


Gly 
95 


Arg 


35 


Gly 


val 


Glu 


val 
100 


Gly 


lie 


val 


Asp 


Thr Gly 
105 


Glu 


ser 


val 


Gly 
110 


ser 


He 


Ser 


Phe 


Pro 
115 


Glu 


Leu 


Tyr 


Gly 


Arg 
120 


Lys Glu 


His 


Gly 


Tyr 
125 


Asn 


Glu 


Asn 


40 


Tyr 


Lys 
130 


Asn 


Tyr 


Thr 


Ala 


Tyr 
135 


Met 


Arg Lys 


Glu 


Ala 
140 


Pro 


Glu 


Asp 


Gly 


Gly 
145 


Gly 


Lys 


ASp 


He 


Glu 
150 


Ala 


Ser 


Phe Asp 


Asp 
155 


Glu 


Ala 


val 


He 


Glu 

160 




Thr 


Glu 


Ala 


Lys 


Pro 
165 


Thr 


Asp 


He 


Arg His 
170 


val 


Lys 


Glu 


He 


Gly 
175 


His 


45 


He 


Asp 


Leu 


val 
180 


Ser 


His 


He 


He 


Gly Gly 
185 


Arg 


ser 


val 


ASp 

190 


Gly 


Arg 




pro 


Ala 


Gly 
195 


Gly 


He 


Ala 


Pro 


ASp 

200 


Ala Thr 


Leu 


His 


lie 

205 


Met 


Asn 


Thr 


50 


Asn 


ASp 

210 


Glu 


Thr 


Lys 


Asn 


Glu 
215 


Met 


Met Val 


Ala 


Ala 
220 


He 


Arg 


Asn 


Ala 




Trp 

225 


val 


Lys 


Leu 


Gly 


Glu 
230 


Arg 


Gly 


Val Arg 


He 
235 


val 


Asn 


Asn 


Ser 


Phe 
240 


55 


Gly 


Thr 


Thr 


Ser 


Arg 
245 


Ala 


Gly 


Thr 


Ala Asp 
250 


Leu 


Phe 


Gin 


He 


Ala 
255 


Asn 
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ser Glu Glu 

Lys Thr Asp 
275 

Leu Ser Tyr 
290 

Gly Asn Asp 
305 

Tyr Glu Lys 
Arg Ser Gly 

Glu Pro Leu 

355 

Cys Leu ser 
370 

pro lie Gin 

385 

Thr Ala Ala 

Leu Arg Thr 

val Asp Ser 
435 

Asn Gly Pro 
450 

Gly Thr Ser 
465 

Gly Gly Leu 

Asn Thr Tyr 

Tyr Gly Asn 
515 

lie Tyr Asn 
530 

Val Tyr Leu 
545 

He Lys Gly 
Leu Gly Lys 



Leu Tyr Met 
595 

Gly Arg Arg 
610 



Gin Tyr 
260 

Glu Gly 
His lie 
Ala Gin 

Asp Ala 

325 

Glu Lys 
340 

Glu Tyr 

Ala Pro 

lie Ala 

Leu Leu 
405 

Thr Leu 
420 

Lys Phe 
Ala Ser 
Asp lie 

lie Lys 

485 

Thr Gly 
500 

Asn Lys 

Gly Ala 

Ala Asp 

Ser Leu 
565 

Leu Leu 
580 

Ser Ala 
val Pro 



Arg Gin 

lie Arg 

Arg Asn 
295 

Ala Gin 
310 

Gin Lys 

Phe Lys 

Gly Ser 

Tyr Glu 
375 

Gly Thr 
390 

Leu Gin 

Leu Thr 

Gly Trp 

Phe Pro 
455 

Ala Tyr 
470 

Lys Gly 

Lys Thr 

Ser Asp 

Ala Ser 
535 

Thr Asp 
550 

Gin Leu 
Lys val 
Arg Gly 



phe Leu 
615 



Ala Leu 
265 

Leu Met 
280 

Lys Asn 

Pro Asn 

Gly He 

Arg Glu 
345 

Asn His 
360 

Ala Ser 
ser Phe 
Lys Tyr 

Thr Ala 

425 

Gly Leu 
440 

Phe Gly 

Ser Phe 

Gly Ser 

lie lie 
505 

Met Arg 
520 

Gly Gly 
Gin Ser 
Asp Gly 



Asp Gly 
585 

Lys Gly 
600 

Ser Ala 



Leu Asp Tyr 

Gin Gin Ser 

Met Leu Phe 
300 

Thr Tyr Ala 
315 

lie Thr Val 
330 

Met Tyr Gly 

Cys Gly lie 

val Arg Phe 
380 

ser Ala pro 
395 

pro Trp Met 
410 

Gin Asp lie 

Leu Asp Ala 

Asp Phe Thr 
460 

Arg Asn Asp 
475 

Gin Leu Gin 
490 

Glu Gly Gly 

Val Glu Thr 

Ser Leu Asn 
540 

Gly Ala Asn 
555 

Lys Gly Thr 
570 

Thr Ala lie 
Ala Gly Tyr 



Ala Lys lie 
620 



Ser Gly Gly Asp 
270 

Asp Tyr Gly Asn 
285 

lie Phe Ser Thr 



Leu Leu 

Ala Gly 

Glu Pro 
350 

Thr Ala 

365 

Thr Arg 

lie val 

Ser Asn 

Gly Ala 
430 

Gly Lys 
445 

Ala Asp 

lie Ser 

Leu His 

Ser Leu 
510 

Lys Gly 
525 

Ser Asp 
Glu Thr 
Leu Tyr 



Pro Phe 
320 

val Asp 
335 

Gly Thr 

Met Trp 

Thr Asn 

Thr Gly 
400 

Asp Asn 
415 

Val Gly 

Ala Met 

Thr Lys 

Gly Thr 
480 

Gly Asn 
495 

val Leu 
Ala Leu 
Gly lie 



val His 
560 

Thr Arg 
575 



lie Gly 
590 

Leu Asn 
605 



Gly Lys 
Ser Thr 
Gly Gin Asp Tyr 
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ser phe 
625 

Asp ser 

Tyr val 

ser Ala 

Leu Glu 
690 

Pro Glu 
705 

lie Arg 

Ala Asn 

val Tyr 

Leu Lys 
770 

val lie 
785 

val Glu 

Lys Thr 

Ser Thr 

Leu Phe 
850 

Gly Leu 
865 

Gly Ala 

Leu Gly 

Leu Thr 

Phe Ala 
930 

Glu Gly 
945 

Ser Asp 
Asn Gly 



Phe Thr 

val Glu 

Arg Arg 
660 

Pro Ala 
675 

Asn Leu 

Thr val 

Pro Tyr 

Ala Ala 
740 

Ala Asp 
755 

Ala val 

Ala Gin 

Gly Lys 

Gly Glu 
820 

Trp Ser 
835 

Ala Gly 

Phe Ser 

Asp Glu 

Ala Leu 
900 

val Glu 
915 

Glu Lys 
Thr Leu 
Lys Ala 



Asn lie 
630 

Lys Thr 
645 

Gly Asn 

Gly Leu 

Met Val 

Glu Thr 
710 

Gly Ala 
725 

Asp Gly 

Ser Thr 

Ser Asp 

Thr Gin 
790 

Met Arg 
805 

Asn Thr 

Glu Asn 

lie Arg 

Tyr Gly 
870 

His Ala 
885 

Gly Gly 
Gly Gly 
Gly Ser 



Arg Asp 
980 



val Gly 
950 

val Leu 
965 

Tyr Thr 



Glu Thr Asp 

Ala Gly Ser 

Ala Ala Arg 
665 

Lys His Ala 
680 

Glu Leu Asp 
695 

Ala Ala Ala 

Thr Phe Arg 

val Arg lie 
745 

Ala Ala His 
760 

Gly Leu Asp 
775 

Gin Asp Gly 

Gly ser Thr 

Thr Ala Ala 
825 

Ser Ala Asn 
840 

His Asp Ala 
855 

Arg Tyr Lys 

Glu Gly Ser 

val Asn val 
905 

Leu Arg Tyr 
920 

Ala Leu Gly 
935 

Leu Ala Gly 
Phe Ala Thr 



Val Thr Gly 
985 



Gly Gly Leu 
635 

Glu Gly Asp 
650 

Thr Ala Ser 

Val Glu Gin 

Ala Ser Glu 
700 

Asp Arg Thr 
715 

Ala Ala Ala 
730 

Phe Asn Ser 

Ala Asp Met 

His Asn Gly 
780 

Gly Thr Trp 
795 

Gin Thr val 
810 

Ala Thr Leu 

Ala Lys Thr 

Gly Asp lie 
860 

Asn Ser He 
875 

val Asn Gly 
890 

Pro Phe Ala 
Asp Leu Leu 



Leu Ala ser Leu 
640 

Thr Leu Ser Tyr 
655 

Ala Ala Ala His 
670 

Gly Gly Ser Asn 
685 

Ser Ser Ala Thr 



Asp Met 

Ala val 

Leu Ala 
750 

Gin Gly 
765 

Thr Gly 

Glu Gin 

Gly lie 

Gly Met 
830 

Asp Ser 
845 

Gly Tyr 
Ser Arg 
Thr Leu 



Pro Gly 
720 

Gin His 
735 

Ala Thr 

Arg Arg 

Leu Arg 

Gly Gly 
800 

Ala Ala 
815 

Gly Arg 
lie Ser 
Leu Lys 



Ser Thr 
880 

Met Gin 
895 



Ala Thr 
910 



Trp Ser Gly 
940 

Leu Lys Leu 
955 

Ala Gly Val 
970 

Gly Phe Thr 



Gly Asp 
Asp Ala 
Asn Ser Leu Thr 



Lys Gin 

925 



ser Gin Pro Leu 
960 

Glu Arg Asp Leu 
975 

Gly Ala Thr Ala 
990 
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Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu val 
995 1000 1005 

Ala Gly Leu Gly Ala Asp val Glu Phe Gly Asn Gly Trp Asn Gly Leu 
1010 1015 1020 

Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gin Tyr Gly Asn His Ser Gly 
1025 1030 1035 1040 

Arg Val Gly val Gly Tyr Arg Phe Leu Glu Gly Gly Gly Gly Thr Gly 
1045 1050 1055 

Ser Ala Thr Asn Asp Asp Asp Val Lys Lys Ala Ala Thr Val Ala lie 
1060 1065 1070 

Ala Ala Ala Tyr Asn Asn Gly Gin Glu lie Asn Gly Phe Lys Ala Gly 
1075 1080 1085 

Glu Thr lie Tyr Asp lie Asp Glu Asp Gly Thr lie Thr Lys Lys Asp 
1090 1095 1100 

Ala Thr Ala Ala Asp val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 
1105 1110 1115 1120 

Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gin 
1125 1130 1135 

Asn val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu lie Glu Lys Leu 
1140 1145 1150 

Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 
1155 1160 1165 

Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn lie 
1170 1175 1180 

Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn He Val Lys lie Asp Glu 
1185 1190 1195 1200 

Lys Leu Glu Ala val Ala Asp Thr val Asp Lys His Ala Glu Ala Phe 
1205 1210 1215 

Asn Asp lie Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 
1220 1225 1230 

Ala Val Lys Thr Ala Asn Glu Ala Lys Gin Thr Ala Glu Glu Thr Lys 
1235 1240 1245 

Gin Asn val Asp Ala Lys val Lys Ala Ala Glu Thr Ala Ala Gly Lys 
1250 1255 1260 

Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 
1265 1270 1275 1280 

Ala val Ala Ala Lys val Thr Asp lie Lys Ala Asp He Ala Thr Asn 
1285 1290 1295 

Lys Asp Asn lie Ala Lys Lys Ala Asn ser Ala Asp val Tyr Thr Arg 
1300 1305 1310 

Glu Glu Ser Asp Ser Lys Phe Val Arg lie Asp Gly Leu Asn Ala Thr 
1315 1320 1325 

Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser lie Ala 
1330 1335 1340 

Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr val Ser Asp Leu 
1345 1350 1355 1360 
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Arg Lys Glu Thr Arg Gin Gly Leu Ala Glu Gin Ala Ala Leu ser Gly 
1365 1370 1375 

Leu Phe Gin Pro Tyr Asn Val Gly Arg Phe Asn Val Thr Ala Ala Val 
1380 1385 1390 

Gly Gly Tyr Lys Ser Glu ser Ala val Ala lie Gly Thr Gly Phe Arg 
1395 1400 1405 

Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala Val Gly Thr ser 
1410 1415 1420 

ser Gly ser ser Ala Ala Tyr His val Gly val Asn Tyr Glu Trp Leu 
1425 1430 1435 1440 

Glu His His His His His His 
1445 



20 



<210> 106 

<211> 4179 

<212> DNA 

<213> Artificial 



<220> 
<223> 



Sequence 



deltaG983-961c 



25 



30 



35 



40 



45 



50 



55 



<400> 106 

atgacttctg cgcccgactt caatgcaggc ggtaccggta tcggcagcaa cagcagagca 60 

acaacagcga aatcagcagc agtatcttac gccggtatca agaacgaaat gtgcaaagac 120 

agaagcatgc tctgtgccgg tcgggatgac gttgcggtta cagacaggga tgccaaaatc 180 

aatgcccccc ccccgaatct gcataccgga gactttccaa acccaaatga cgcatacaag 240 

aatttgatca acctcaaacc tgcaattgaa gcaggctata caggacgcgg ggtagaggta 300 

ggtatcgtcg acacaggcga atccgtcggc agcatatcct ttcccgaact gtatggcaga 360 

aaagaacacg gctataacga aaattacaaa aactatacgg cgtatatgcg gaaggaagcg 420 

cctgaagacg gaggcggtaa agacattgaa gcttctttcg acgatgaggc cgttatagag 480 

actgaagcaa agccgacgga tatccgccac gtaaaagaaa tcggacacat cgatttggtc 540 

tcccatatta ttggcgggcg ttccgtggac ggcagacctg caggcggtat tgcgcccgat 600 

gcgacgctac acataatgaa tacgaatgat gaaaccaaga acgaaatgat ggttgcagcc 660 

atccgcaatg catgggtcaa gctgggcgaa cgtggcgtgc gcatcgtcaa taacagtttt 720 

ggaacaacat cgagggcagg cactgccgac cttttccaaa tagccaattc ggaggagcag 780 

taccgccaag cgttgctcga ctattccggc ggtgataaaa cagacgaggg tatccgcctg 840 

atgcaacaga gcgattacgg caacctgtcc taccacatcc gtaataaaaa catgcttttc 900 

atcttttcga caggcaatga cgcacaagct cagcccaaca catatgccct attgccattt 960 

tatgaaaaag acgctcaaaa aggcattatc acagtcgcag gcgtagaccg cagtggagaa 1020 

aagttcaaac gggaaatgta tggagaaccg ggtacagaac cgcttgagta tggctccaac 1080 

cattgcggaa ttactgccat gtggtgcctg tcggcaccct atgaagcaag cgtccgtttc 1140 

acccgtacaa acccgattca aattgccgga acatcctttt ccgcacccat cgtaaccggc 1200 

acggcggctc tgctgctgca gaaatacccg tggatgagca acgacaacct gcgtaccacg 1260 

ttgctgacga cggctcagga catcggtgca gtcggcgtgg acagcaagtt cggctgggga 1320 

ctgctggatg cgggtaaggc catgaacgga cccgcgtcct ttccgttcgg cgactttacc 1380 

gccgatacga aaggtacatc cgatattgcc tactccttcc gtaacgacat ttcaggcacg 1440 

ggcggcctga tcaaaaaagg cggcagccaa ctgcaactgc acggcaacaa cacctatacg 1500 

ggcaaaacca ttatcgaagg cggttcgctg gtgttgtacg gcaacaacaa atcggatatg 1560 

cgcgtcgaaa ccaaaggtgc gctgatttat aacggggcgg catccggcgg cagcctgaac 1620 

agcgacggca ttgtctatct ggcagatacc gaccaatccg gcgcaaacga aaccgtacac 1680 

atcaaaggca gtctgcagct ggacggcaaa ggtacgctgt acacacgttt gggcaaactg 1740 

ctgaaagtgg acggtacggc gattatcggc ggcaagctgt acatgtcggc acgcggcaag 1800 

ggggcaggct atctcaacag taccggacga cgtgttccct tcctgagtgc cgccaaaatc 1860 

gggcaggatt attctttctt cacaaacatc gaaaccgacg gcggcctgct ggcttccctc 1920 

gacagcgtcg aaaaaacagc gggcagtgaa ggcgacacgc tgtcctatta tgtccgtcgc 1980 

ggcaatgcgg cacggactgc ttcggcagcg gcacattccg cgcccgccgg tctgaaacac 2040 

gccgtagaac agggcggcag caatctggaa aacctgatgg tcgaactgga tgcctccgaa 2100 

tcatccgcaa cacccgagac ggttgaaact gcggcagccg accgcacaga tatgccgggc 2160 

atccgcccct acggcgcaac tttccgcgca gcggcagccg tacagcatgc gaatgccgcc 2220 

gacggtgtac gcatcttcaa cagtctcgcc gctaccgtct atgccgacag taccgccgcc 2280 

catgccgata tgcagggacg ccgcctgaaa gccgtatcgg acgggttgga ccacaacggc 2340 

acgggtctgc gcgtcatcgc gcaaacccaa caggacggtg gaacgtggga acagggcggt 2400 

gttgaaggca aaatgcgcgg cagtacccaa accgtcggca ttgccgcgaa aaccggcgaa 2460 

aatacgacag cagccgccac actgggcatg ggacgcagca catggagcga aaacagtgca 2520 



170 



EP 1 790 660 A2 



10 



15 



20 



25 



aatgcaaaaa ccgacagcat tagtctgttt gcaggcatac 
ggctatctca aaggcctgtt ctcctacgga cgctacaaaa 
QQtgcggacg aacatgcgga aggcagcgtc aacggcacgc 
ggcggtgtca acgttccgtt tgccgcaacg ggagatttga 
tacgacctgc tcaaacagga tgcattcgcc gaaaaaggca 
aacagcctca ctgaaggcac gctggtcgga ctcgcgggtc 
agcgataaag ccgtcctgtt tgcaacggcg ggcgtggaac 
tacacggtaa cgggcggctt taccggcgcg actgcagcaa 
aatatgccgc acacccgtct ggttgccggc ctgggcgcgg 
tggaacggct tggcacgtta cagctacgcc ggttccaaac 
cgagtcggcg taggctaccg gttcctcgag ggtggcggag 
gacgacgatg ttaaaaaagc tgccactgtg gccattgctg 
gaaatcaacg gtttcaaagc tggagagacc atctacgaca 
accaaaaaag acgcaactgc agccgatgtt gaagccgacg 
aaaaaagtcg tgactaacct gaccaaaacc gtcaatgaaa 
aaagtaaaag ctgcagaatc tgaaatagaa aagttaacaa 
gccgctttag cagatactga tgccgctctg gatgcaacca 
ggagaaaata taacgacatt tgctgaagag actaagacaa 
aaattagaag ccgtggctga taccgtcgac aagcatgccg 
gattcattgg atgaaaccaa cactaaggca gacgaagccg 
aaacagacgg ccgaagaaac caaacaaaac gtcgatgcca 
gcagcaggca aagccgaagc tgccgctggc acagctaata 
gctgtcgctg caaaagttac cgacatcaaa gctgatatcg 
gctaaaaaag caaacagtgc cgacgtgtac accagagaag 
agaattgatg gtctgaacgc tactaccgaa aaattggaca 
aaatccattg ccgatcacga tactcgcctg aacggtttgg 
cgcaaagaaa cccgccaagg ccttgcagaa caagccgcgc 
tacaacgtgg gtctcgagca ccaccaccac caccactga 

<210> 107 

<211> 1392 

<212> PRT 

<213> Artificial sequence 



ggcacgatgc 
acagcatcag 
tgatgcagct 
cggtcgaagg 
gtgctttggg 
tgaagctgtc 
gcgacctgaa 
ccggcaagac 
atgtcgaatt 
agtacggcaa 
gcactggatc 
ctgcctacaa 
ttgatgaaga 
actttaaagg 
acaaacaaaa 
ccaagttagc 
ccaacgcctt 
atatcgtaaa 
aagcattcaa 
tcaaaaccgc 
aagtaaaagc 
ctgcagccga 
ctacgaacaa 
agtctgacag 
cacgcttggc 
ataaaacagt 
tctccggtct 



gggcgatatc 
ccgcagcacc 
gggcgcactg 
cggtctgcgc 
ctggagcggc 
gcaacccttg 
cggacgcgac 
gggggcacgc 
cggcaacggc 
ccacagcgga 
cgccacaaac 
caatggccaa 
cggcacaatt 
tctgggtctg 
cgtcgatgcc 
agacactgat 
gaataaattg 
aattgatgaa 
cgatatcgcc 
caatgaagcc 
tgcagaaact 
caaggccgaa 
agataatatt 
caaatttgtc 
ttctgctgaa 
gtcagacctg 
gttccaacct 



2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4179 



30 



<220> 
<223> 



deltaG983-961c 



35 



40 



45 



50 



<400> 107 

Met Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly lie Gly ser 
1 5 10 15 

Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val Ser Tyr Ala Gly 
20 25 30 

lie Lys Asn Glu Met cys Lys Asp Arg ser Met Leu cys Ala Gly Arg 
35 40 45 

Asp Asp Val Ala Val Thr Asp Arg Asp Ala Lys lie Asn Ala Pro Pro 
50 55 60 

Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys 
65 70 75 80 

Asn Leu lie Asn Leu Lys Pro Ala lie Glu Ala Gly Tyr Thr Gly Arg 
85 90 95 

Gly Val Glu val Gly lie Val Asp Thr Gly Glu Ser val Gly Ser lie 
100 105 110 

Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn 
115 120 125 

Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly 
130 135 140 



55 



Gly Gly Lys Asp lie Glu Ala Ser Phe Asp Asp 
145 150 155 



val lie Glu 
160 



Glu Ala 

Thr Glu Ala Lys Pro Thr Asp He Arg His Val Lys Glu lie Gly His 
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165 170 175 

lie Asp Leu val Ser His lie He Gly Gly Arg Ser Val Asp Gly Arg 
180 185 190 

Pro Ala Gly Gly lie Ala Pro Asp Ala Thr Leu His lie Met Asn Thr 
195 200 205 

Asn Asp Glu Thr Lys Asn Glu Met Met val Ala Ala lie Arg Asn Ala 
210 215 220 

Trp Val Lys Leu Gly Glu Arg Gly Val Arg lie Val Asn Asn Ser Phe 
225 230 235 240 

Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gin lie Ala Asn 
245 250 255 

ser Glu Glu Gin Tyr Arg Gin Ala Leu Leu Asp Tyr ser Gly Gly Asp 
260 265 270 

Lys Thr Asp Glu Gly lie Arg Leu Met Gin Gin Ser Asp Tyr Gly Asn 
275 280 285 

Leu Ser Tyr His lie Arg Asn Lys Asn Met Leu Phe lie Phe Ser Thr 
290 295 300 

Gly Asn Asp Ala Gin Ala Gin Pro Asn Thr Tyr Ala Leu Leu Pro Phe 
305 310 315 320 

Tyr Glu Lys Asp Ala Gin Lys Gly lie lie Thr val Ala Gly val Asp 
325 330 335 

Arg ser Gly Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr 
340 345 350 

Glu Pro Leu Glu Tyr Gly Ser Asn His Cys Gly lie Thr Ala Met Trp 
355 360 365 

Cys Leu Ser Ala Pro Tyr Glu Ala Ser Val Arg Phe Thr Arg Thr Asn 
370 375 380 

Pro lie Gin lie Ala Gly Thr ser Phe ser Ala Pro lie val Thr Gly 
385 390 395 400 

Thr Ala Ala Leu Leu Leu Gin Lys Tyr Pro Trp Met Ser Asn Asp Asn 
405 410 415 

Leu Arg Thr Thr Leu Leu Thr Thr Ala Gin Asp lie Gly Ala val Gly 
420 425 430 

Val Asp Ser Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met 
435 ' 440 445 

Asn Gly Pro Ala ser phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys 
450 455 460 

Gly Thr ser Asp lie Ala Tyr ser Phe Arg Asn Asp lie Ser Gly Thr 
465 470 475 480 

Gly Gly Leu lie Lys Lys Gly Gly Ser Gin Leu Gin Leu His Gly Asn 
485 490 495 

Asn Thr Tyr Thr Gly Lys Thr lie lie Glu Gly Gly Ser Leu val Leu 
500 505 510 

Tyr Gly Asn Asn Lys Ser Asp Met Arg val Glu Thr Lys Gly Ala Leu 
515 520 525 

lie Tyr Asn Gly Ala Ala Ser Gly Gly Ser Leu Asn ser Asp Gly lie 
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530 

val Tyr 
545 

lie Lys 

Leu Gly 

Leu Tyr 

Gly Arg 
610 

Ser Phe 
625 

Asp Ser 

Tyr val 

Ser Ala 

Leu Glu 
690 

Pro Glu 
705 

lie Arg 

Ala Asn 

Val Tyr 

Leu Lys 
770 

val lie 

785 

Val Glu 
Lys Thr 
Ser Thr 



Leu Phe 
850 

Gly Leu 
865 



Leu Ala Asp 

Gly Ser Leu 
565 

Lys Leu Leu 
580 

Met ser Ala 
595 

Arg Val Pro 

Phe Thr Asn 

val Glu Lys 
645 

Arg Arg Gly 
660 

Pro Ala Gly 
675 

Asn Leu Met 

Thr Val Glu 

Pro Tyr Gly 
725 

Ala Ala Asp 
740 

Ala Asp Ser 
755 

Ala Val Ser 

Ala Gin Thr 

Gly Lys Met 
805 

Gly Glu Asn 
820 

Trp Ser Glu 
835 

Ala Gly He 
Phe ser Tyr 



Gly Ala 
Leu Gly Ala Leu Gly 



Asp Glu His 
885 



535 

Thr Asp 
550 

Gin Leu 

Lys Val 

Arg Gly 

Phe Leu 
615 

He Glu 

630 

Thr Ala 

Asn Ala 

Leu Lys 

val Glu 
695 

Thr Ala 

710 

Ala Thr 

Gly Val 

Thr Ala 

Asp Gly 
775 

Gin Gin 
790 

Arg Gly 
Thr Thr 

Asn Ser 

Arg His 
855 

Gly Arg 
870 

Ala Glu 
Gly val 



Gin Ser 

Asp Gly 

Asp Gly 
585 

Lys Gly 
600 

Ser Ala 

Thr Asp 

Gly Ser 

Ala Arg 
665 

His Ala 
680 

Leu Asp 
Ala Ala 
Phe Arg 

Arg lie 

745 

Ala His 
760 

Leu Asp 
Asp Gly 
Ser Thr 



Ala Ala 

825 

Ala Asn 
840 



Gly Ala 
555 

Lys Gly 
570 

Thr Ala 

Ala Gly 

Ala Lys 

Gly Gly 
635 

Glu Gly 

650 

Thr Ala 

Val Glu 

Ala ser 

Asp Arg 
715 

Ala Ala 
730 

Phe Asn 
Ala Asp 
His Asn 

Gly Thr 

795 

Gin Thr 
810 

Ala Thr 
Ala Lys 
Gly Asp 



Asp Ala 
Tyr Lys 
Gly Ser 
Asn Val Pro Phe 



540 

Asn Glu Thr 

Thr Leu Tyr 

He lie Gly 
590 

Tyr Leu Asn 
605 

lie Gly Gin 
620 

Leu Leu Ala 

Asp Thr Leu 

ser Ala Ala 
670 

Gin Gly Gly 
685 

Glu ser ser 
700 

Thr Asp Met 

Ala Ala val 

Ser Leu Ala 
750 

Met Gin Gly 
765 

Gly Thr Gly 
780 

Trp Glu Gin 
val Gly He 



Leu Gly Met 

830 

Thr Asp Ser 
845 

lie Gly Tyr 
860 



Val His 
560 

Thr Arg 

575 

Gly Lys 

Ser Thr 

Asp Tyr 

Ser Leu 
640 

Ser Tyr 
655 

Ala His 

Ser Asn 

Ala Thr 

Pro Gly 
720 

Gin His 

735 

Ala Thr 

Arg Arg 

Leu Arg 

Gly Gly 
800 

Ala Ala 
815 

Gly Arg 
lie Ser 
Leu Lys 



Asn ser 
875 

Val Asn 
890 



ser Thr 
880 



lie ser Arg 
Gly Thr Leu 
Ala Ala Thr Gly Asp 



Met Gin 
895 
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900 905 910 

Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gin Asp Ala 
915 920 925 

Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn Ser Leu Thr 
930 935 940 

Glu Gly Thr Leu val Gly Leu Ala Gly Leu Lys Leu Ser Gin Pro Leu 
945 950 955 960 

ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly val Glu Arg Asp Leu 
965 970 975 

Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala 
980 985 990 

Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val 
995 1000 1005 

Ala Gly Leu Gly Ala Asp val Glu Phe Gly Asn Gly Trp Asn Gly Leu 
1010 1015 1020 

Ala Arg Tyr ser Tyr Ala Gly ser Lys Gin Tyr Gly Asn His ser Gly 
1025 1030 1035 1040 

Arg Val Gly Val Gly Tyr Arg Phe Leu Glu Gly Gly Gly Gly Thr Gly 
1045 1050 1055 

Ser Ala Thr Asn Asp Asp Asp val Lys Lys Ala Ala Thr val Ala lie 
1060 1065 1070 

Ala Ala Ala Tyr Asn Asn Gly Gin Glu lie Asn Gly Phe Lys Ala Gly 
107 5 1080 1085 

Glu Thr lie Tyr Asp lie Asp Glu Asp Gly Thr lie Thr Lys Lys Asp 
1090 1095 1100 

Ala Thr Ala Ala Asp val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 
1105 1110 1115 1120 

Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gin 
1125 1130 1135 

Asn val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu lie Glu Lys Leu 
1140 1145 1150 

Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 
1155 1160 1165 

Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn lie 
1170 1175 1180 

Thr Thr phe Ala Glu Glu Thr Lys Thr Asn lie val Lys lie Asp Glu 
1185 1190 1195 1200 

Lys Leu Glu Ala Val Ala Asp Thr val Asp Lys His Ala Glu Ala Phe 
1205 1210 1215 

Asn Asp lie Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 
1220 1225 1230 

Ala Val Lys Thr Ala Asn Glu Ala Lys Gin Thr Ala Glu Glu Thr Lys 
1235 1240 1245 

Gin Asn val Asp Ala Lys val Lys Ala Ala Glu Thr Ala Ala Gly Lys 
1250 1255 1260 

Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 
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<212> PRT 

<213> Artificial Sequence 
<220> 

<223> deltaG741-961 

<400> 109 

Met val Ala Ala Asp lie Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala 
1 5 10 15 

Pro Leu Asp His Lys Asp Lys Gly Leu Gin ser Leu Thr Leu Asp Gin 
20 25 30 

ser val Arg Lys Asn Glu Lys Leu Lys Leu Ala Ala Gin Gly Ala Glu 
35 40 45 

Lys Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn 
50 55 60 

Asp Lys val Ser Arg Phe Asp Phe lie Arg Gin lie Glu val Asp Gly 
65 70 75 80 

Gin Leu lie Thr Leu Glu ser Gly Glu Phe Gin val Tyr Lys Gin Ser 
85 90 95 

His Ser Ala Leu Thr Ala Phe Gin Thr Glu Gin lie Gin Asp Ser Glu 
100 105 110 

His Ser Gly Lys Met Val Ala Lys Arg Gin Phe Arg lie Gly Asp lie 
115 120 125 

Ala Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala 
130 13 5 140 

Thr Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu 
145 150 155 160 

Thr Tyr Thr lie Asp Phe Ala Ala Lys Gin Gly Asn Gly Lys lie Glu 
165 170 175 

His Leu Lys Ser Pro Glu Leu Asn val Asp Leu Ala Ala Ala Asp lie 
180 185 190 

Lys Pro Asp Gly Lys Arg His Ala Val lie Ser Gly Ser val Leu Tyr 
195 200 205 

Asn Gin Ala Glu Lys Gly Ser Tyr ser Leu Gly lie Phe Gly Gly Lys 
210 215 220 

Ala Gin Glu val Ala Gly Ser Ala Glu val Lys Thr val Asn Gly lie 
225 230 235 240 

Arg His lie Gly Leu Ala Ala Lys Gin Leu Glu Gly Gly Gly Gly Thr 
245 250 255 

Gly Ser Ala Thr Asn Asp Asp Asp val Lys Lys Ala Ala Thr val Ala 
260 265 270 

lie Ala Ala Ala Tyr Asn Asn Gly Gin Glu lie Asn Gly Phe Lys Ala 
275 280 285 

Gly Glu Thr lie Tyr Asp He Asp Glu Asp Gly Thr lie Thr Lys Lys 
290 295 300 

Asp Ala Thr Ala Ala Asp val Glu Ala Asp Asp Phe Lys Gly Leu Gly 
305 310 315 320 

Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr val Asn Glu Asn Lys 
325 330 335 
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Gin Asn val Asp Ala Lys val Lys Ala Ala Glu Ser Glu lie Glu Lys 
340 345 350 

Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp 
355 360 365 

Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn 
370 375 380 

lie Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn lie Val Lys lie Asp 
385 390 395 400 

Glu Lys Leu Glu Ala val Ala Asp Thr val Asp Lys His Ala Glu Ala 
405 410 415 

Phe Asn Asp lie Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp 
420 425 430 

Glu Ala val Lys Thr Ala Asn Glu Ala Lys Gin Thr Ala Glu Glu Thr 
435 440 445 

Lys Gin Asn val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly 
450 455 460 

Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala 
465 470 475 480 

Glu Ala val Ala Ala Lys val Thr Asp lie Lys Ala Asp lie Ala Thr 
485 490 495 

Asn Lys Asp Asn lie Ala Lys Lys Ala Asn Ser Ala Asp Val Tyr Thr 
500 505 510 

Arg Glu Glu Ser Asp Ser Lys Phe Val Arg lie Asp Gly Leu Asn Ala 
515 520 525 

Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser lie 
530 535 540 

Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr val Ser Asp 
545 550 555 560 

Leu Arg Lys Glu Thr Arg Gin Gly Leu Ala Glu Gin Ala Ala Leu Ser 
565 570 575 

Gly Leu Phe Gin Pro Tyr Asn val Gly Arg Phe Asn Val Thr Ala Ala 
580 585 590 

Val Gly Gly Tyr Lys Ser Glu Ser Ala val Ala lie Gly Thr Gly Phe 
595 600 605 

Arg Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly val Ala val Gly Thr 
610 615 620 

Ser Ser Gly Ser Ser Ala Ala Tyr His Val Gly Val Asn Tyr Glu Trp 
625 ' 630 635 640 

Leu Glu His His His His His His 
645 

<210> 110 
<211> 1782 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> deltaG741-961c 
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145 

Thr Tyr Thr lie 



His Leu 
Lys Pro 

Asn Gin 

210 

Ala Gin 
225 

Arg His 

Gly Ser 

lie Ala 

Gly Glu 
290 

Asp Ala 
305 

Leu Lys 

Gin Asn 

Leu Thr 

Ala Ala 
370 

lie Thr 
385 

Glu Lys 
Phe Asn 
Glu Ala 



Lys Gin 
450 

Lys Ala 
465 



Lys Ser 
180 

Asp Gly 
195 

Ala Glu 

Glu val 

lie Gly 

Ala Thr 
260 

Ala Ala 

275 

Thr lie 

Thr Ala 

Lys Val 

Val Asp 
340 

Thr Lys 

355 

Leu Asp 

Thr Phe 

Leu Glu 

Asp lie 
420 

val Lys 
435 

Asn Val 
Glu Ala 
val Ala 



Glu Ala 
Asn Lys 
Arg Glu Glu ser 



Asp Asn 
500 



150 

Asp Phe Ala 
165 

Pro Glu Leu 
Lys Arg His 

Lys Gly Ser 

215 

Ala Gly ser 
230 

Leu Ala Ala 
245 

Asn Asp Asp 
Tyr Asn Asn 

Tyr Asp lie 

295 

Ala Asp val 
310 

val Thr Asn 
325 

Ala Lys Val 
Leu Ala Asp 

Ala Thr Thr 

375 

Ala Glu Glu 
390 

Ala Val Ala 
405 

Ala Asp ser 

Thr Ala Asn 

Asp Ala Lys 
455 

Ala Ala Gly 
470 

Ala Lys Val 
485 

lie Ala Lys 
Asp ser Lys 



Ala Lys Gin 
170 

Asn Val Asp 
185 

Ala val lie 
200 

Tyr Ser Leu 

Ala Glu val 

Lys Gin Leu 
250 

Asp val Lys 
265 

Gly Gin Glu 
280 

Asp Glu Asp 

Glu Ala Asp 

Leu Thr Lys 
330 

Lys Ala Ala 
345 

Thr Asp Ala 
360 

Asn Ala Leu 

Thr Lys Thr 

Asp Thr Val 
410 

Leu Asp Glu 
425 

Glu Ala Lys 
440 

val Lys Ala 
Thr Ala Asn 



155 

Gly Asn Gly Lys 



Leu Ala 

Ser Gly 

Gly lie 

220 

Lys Thr 
235 

Glu Gly 

Lys Ala 

lie Asn 

Gly Thr 
300 

Asp Phe 
315 

Thr val 

Glu Ser 

Ala Leu 

Asn Lys 
380 

Asn lie 
395 

Asp Lys 
Thr Asn 
Gin Thr 



Ala Glu 
460 

Thr Ala 
475 



Thr Asp lie 
490 

Lys Ala Asn 
505 



Ala Ala 
190 

ser val 
205 

Phe Gly 

val Asn 

Gly Gly 

Ala Thr 
270 

Gly Phe 
285 

lie Thr 

Lys Gly 

Asn Glu 

Glu lie 
350 

Ala Asp 
365 

Leu Gly 

val Lys 

His Ala 

Thr Lys 
430 

Ala Glu 
445 

Thr Ala 
Ala Asp 
Asp lie 



Lys Ala 
ser Ala 
Phe val Arg lie Asp Gly Leu 



Asp val 
510 



160 

lie Glu 
175 

Asp lie 

Leu Tyr 

Gly Lys 

Gly lie 
240 

Gly Thr 
255 

Val Ala 
Lys Ala 
Lys Lys 

Leu Gly 

320 

Asn Lys 
335 

Glu Lys 

Thr Asp 

Glu Asn 

lie Asp 
400 

Glu Ala 
415 

Ala Asp 

Glu Thr 

Ala Gly 

Lys Ala 
480 

Ala Thr 
495 

Tyr Thr 
Asn Ala 
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Lys Pro Asp 
195 

Asn Gin Ala 
210 

Ala Gin Glu 

225 

Arg His lie 

Gly Thr ser 

Asn Ser Arg 
275 

lie Lys Asn 
290 

Asp Asp val 
305 

Pro Asn Leu 
Asn Leu lie 

Gly val Glu 

355 

Ser Phe Pro 

370 

Tyr Lys Asn 
385 

Gly Gly Lys 

Thr Glu Ala 

lie Asp Leu 
435 

Pro Ala Gly 
450 

Asn Asp Glu 
465 

Trp Val Lys 
Gly Thr Thr 



Ser Glu Glu 
515 

Lys Thr Asp 

530 

Leu Ser Tyr 
545 



Gly Lys Arg His 
Glu Lys 
Val Ala 



Gly Ser 
215 



Gly Leu 
245 

Ala Pro 
260 

Ala Thr 

Glu Met 

Ala Val 

His Thr 
325 

Asn Leu 
340 

val Gly 

Glu Leu 

Tyr Thr 

Asp lie 
405 

Lys Pro 
420 

Val Ser 

Gly He 

Thr Lys 

Leu Gly 
485 

Ser Arg 
500 

Gin Tyr 
Glu Gly 
His lie 



Gly Ser 
230 

Ala Ala 

Asp Phe 

Thr Ala 

Cys Lys 
295 

Thr Asp 
310 

Gly Asp 

Lys Pro 

lie val 

Tyr Gly 
375 

Ala Tyr 
390 

Glu Ala 

Thr Asp 

His He 

Ala Pro 
455 

Asn Glu 
470 

Glu Arg 
Ala Gly 
Arg Gin 



lie Arg 
535 

Arg Asn 
550 



Ala val 
200 

Tyr Ser 
Ala Glu 
Lys Gin 

Asn Ala 

265 

Lys Ser 
280 

Asp Arg 
Arg Asp 
Phe Pro 

Ala lie 

345 

Asp Thr 
360 

Arg Lys 

Met Arg 

Ser Phe 

lie Arg 
425 

He Gly 
440 

Asp Ala 
Met Met 
Gly val 

Thr Ala 

505 

Ala Leu 

520 

Leu Met 
Lys Asn 



lie Ser Gly 

Leu Gly lie 

220 

Val Lys Thr 
235 

Leu Glu Gly 
250 

Gly Gly Thr 

Ala Ala Val 

ser Met Leu 
300 

Ala Lys lie 
315 

Asn Pro Asn 
330 

Glu Ala Gly 

Gly Glu Ser 

Glu His Gly 
380 

Lys Glu Ala 
395 

ASp Asp Glu 
410 

His val Lys 

Gly Arg Ser 

Thr Leu His 
460 

val Ala Ala 
475 

Arg lie val 
490 

Asp Leu Phe 

Leu Asp Tyr 

Gin Gin ser 
540 

Met Leu Phe 
555 



ser val 
205 

Phe Gly 
val Asn 
ser Gly 

Gly lie 

270 

Ser Tyr 
285 

Cys Ala 

Asn Ala 

Asp Ala 

Tyr Thr 
350 

val Gly 
365 

Tyr Asn 
Pro Glu 
Ala val 



Leu Tyr 

Gly Lys 

Gly lie 
240 

Gly Gly 
255 

Gly ser 

Ala Gly 

Gly Arg 

Pro Pro 
320 

Tyr Lys 
335 

Gly Arg 
Ser lie 
Glu Asn 



Asp Gly 
400 

lie Glu 
415 



Glu lie 
430 



Gly His 
Gly Arg 
lie Met Asn Thr 



Val Asp 
445 



lie Arg Asn Ala 
480 

Asn Asn Ser Phe 
495 

Gin lie Ala Asn 
510 

Ser Gly Gly Asp 
525 

Asp Tyr Gly Asn 



lie Phe Ser Thr 
560 
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Gly Asn Asp Ala 
Tyr Glu 
Arg Ser 



Lys Asp 
580 



Glu pro 
610 

Cys Leu 
625 

Pro lie 

Thr Ala 

Leu Arg 

val Asp 
690 

Asn Gly 
705 

Gly Thr 

Gly Gly 

Asn Thr 

Tyr Gly 
770 

lie Tyr 

785 

val Tyr 

lie Lys 

Leu Gly 

Leu Tyr 
850 

Gly Arg 
865 

Ser Phe 
Asp ser 
Tyr Val 



Gly Glu 
595 

Leu Glu 

ser Ala 

Gin lie 

Ala Leu 
660 

Thr Thr 

675 

Ser Lys 

Pro Ala 

Ser Asp 

Leu lie 
740 

Tyr Thr 

755 

Asn Asn 

Asn Gly 

Leu Ala 

Gly ser 
820 

Lys Leu 
835 

Met ser 
Arg val 
Phe Thr 



val Glu 
900 

Arg Arg 
915 



Gin Ala 
565 

Ala Gin 

Lys Phe 

Tyr Gly 

Pro Tyr 
630 

Ala Gly 
645 

Leu Leu 

Leu Leu 

Phe Gly 

Ser Phe 
710 

lie Ala 
725 

Lys Lys 

Gly Lys 

Lys Ser 

Ala Ala 
790 

Asp Thr 
805 

Leu Gin 

Leu Lys 

Ala Arg 

Pro Phe 
870 

Asn lie 
885 

Lys Thr 
Gly Asn 



Gin Pro 

Lys Gly 

Lys Arg 
600 

Ser Asn 
615 

Glu Ala 
Thr Ser 
Gin Lys 



Thr Thr 
680 

Trp Gly 

695 



Asn Thr 
570 

lie lie 

585 

Glu Met 
His Cys 
ser val 



Tyr Ala 
Thr val 
Tyr Gly 



Gly lie 
620 

Arg Phe 
635 



Leu Leu Pro Phe 
575 

Ala Gly val Asp 
590 

Glu Pro Gly Thr 
605 

Thr Ala Met Trp 



Phe Ser 
650 



Ala Pro 
Trp Met 
Ala Gin Asp lie 



Tyr Pro 
665 



Leu Leu 



Pro Phe Gly Asp 
Tyr Ser 
Gly Gly 



Phe Arg 
730 



Thr lie 
760 

Asp Met 
775 

Ser Gly 
Asp Gin 
Leu Asp 



Ser Gin 
745 

lie Glu 
Arg val 
Gly Ser 



Ser Gly 
810 

Gly Lys 
825 



val Asp 
840 

Gly Lys 
855 



Gly Thr 
Gly Ala 
Leu Ser Ala Ala 



Asp Ala 
700 

Phe Thr 
715 

Asn Asp 
Leu Gin 
Gly Gly 

Glu Thr 

780 

Leu Asn 
795 

Ala Asn 
Gly Thr 
Ala lie 



Gly Tyr 
860 

Lys lie 
875 



Thr Arg 

lie val 

Ser Asn 
670 

Gly Ala 
685 

Gly Lys 

Ala Asp 

lie Ser 

Leu His 
750 

ser Leu 
765 

Lys Gly 
Ser Asp 
Glu Thr 



Thr Asn 
640 

Thr Gly 

655 

Asp Asn 

val Gly 

Ala Met 

Thr Lys 
720 

Gly Thr 
735 

Gly Asn 
val Leu 
Ala Leu 



Gly lie 
800 

val His 
815 



Thr Arg 
Gly Lys 
Leu Asn Ser Thr 



Leu Tyr 
830 

lie Gly 
845 



Glu Thr 
Ala Gly 



Ala Ala 
920 



Asp Gly 
890 

Ser Glu 
905 



Gly Leu 
Gly Asp 
Arg Thr Ala Ser 



Gly Gin Asp Tyr 
880 

Leu Ala Ser Leu 
895 

Thr Leu Ser Tyr 
910 

Ala Ala Ala His 
925 
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Ser Ala Pro Ala Gly Leu Lys His Ala Val Glu Gin Gly Gly Ser Asn 
930 935 940 

Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu Ser Ser Ala Thr 
945 950 955 960 

Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly 
965 970 975 

lie Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala val Gin His 
980 985 990 

Ala Asn Ala Ala Asp Gly val Arg lie Phe Asn ser Leu Ala Ala Thr 
995 1000 1005 

Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gin Gly Arg Arg 
1010 1015 1020 

Leu Lys Ala val ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg 
1025 1030 1035 1040 

val lie Ala Gin Thr Gin Gin Asp Gly Gly Thr Trp Glu Gin Gly Gly 
1045 1050 1055 

Val Glu Gly Lys Met Arg Gly Ser Thr Gin Thr Val Gly lie Ala Ala 
1060 1065 1070 

Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg 
1075 1080 1085 

Ser Thr Trp Ser Glu Asn Ser Ala Asn Ala Lys Thr Asp Ser lie Ser 
1090 1095 1100 

Leu Phe Ala Gly lie Arg His Asp Ala Gly Asp lie Gly Tyr Leu Lys 
1105 1110 1115 1120 

Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser lie Ser Arg Ser Thr 
1125 1130 1135 

Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly Thr Leu Met Gin 
1140 1145 1150 

Leu Gly Ala Leu Gly Gly Val Asn Val Pro Phe Ala Ala Thr Gly Asp 
1155 1160 1165 

Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gin Asp Ala 
1170 1175 1180 

Phe Ala Glu Lys Gly ser Ala Leu Gly Trp ser Gly Asn ser Leu Thr 
1185 1190 1195 1200 

Glu Gly Thr Leu val Gly Leu Ala Gly Leu Lys Leu Ser Gin pro Leu 
1205 1210 1215 

Ser Asp Lys Ala val Leu Phe Ala Thr Ala Gly val Glu Arg Asp Leu 
1220 1225 1230 

Asn Gly Arg Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala 
1235 1240 1245 

Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu Val 
1250 1255 1260 

Ala Gly Leu Gly Ala Asp Val Glu Phe Gly Asn Gly Trp Asn Gly Leu 
1265 1270 1275 1280 

Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gin Tyr Gly Asn His Ser Gly 
1285 1290 1295 
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Asp Lys 
65 

Gin Leu 

His Ser 

His Ser 

Ala Gly 
130 

Thr Tyr 
145 

Thr Tyr 

His Leu 

Lys Pro 

Asn Gin 
210 

Ala Gin 

225 

Arg His 
Gly ser 
Arg Gin 



val Ser 

lie Thr 

Ala Leu 
100 

Gly Lys 
115 

Glu His 

Arg Gly 

Thr lie 

Lys Ser 
180 

Asp Gly 
195 

Ala Glu 
Glu val 
lie Gly 



ser Asp 
260 

His Phe 

275 



Gly Glu 
290 

ser His 
305 



Leu Ala 
Gin Leu 
Asn lie Gly Tyr 



ser Pro 

Ser Pro 

Glu His 
370 

Pro Ala 

385 

val Ala 
Gin Arg 



Phe Asp 
340 

val Asp 
355 

His Pro 



pro Lys 
Gin Asn 



Leu Ala 
420 



Arg Phe Asp 
70 

Leu Glu Ser 
85 

Thr Ala Phe 

Met Val Ala 

Thr Ser Phe 
135 

Thr Ala Phe 
150 

Asp Phe Ala 
165 

Pro Glu Leu 

Lys Arg His 

Lys Gly ser 
215 

Ala Gly ser 
230 

Leu Ala Ala 
245 

Leu Ala Asn 

Glu Pro Asp 

Glu Arg ser 
295 

Gly Asn Leu 
310 

lie val Arg 
325 

Asn His Ala 
Gly Phe Ser 



Ala Asp Gly 
375 

Gly Ala Arg 

390 

lie Arg Leu 
405 

Asp Arg Phe 



Phe lie Arg 

Gly Glu Phe 
90 

Gin Thr Glu 
105 

Lys Arg Gin 
120 

Asp Lys Leu 

Gly Ser Asp 

Ala Lys Gin 
170 

Asn val Asp 
185 

Ala val lie 
200 

Tyr ser Leu 
Ala Glu val 

Lys Gin Leu 

250 

Asp Ser Phe 

265 

Gly Lys Tyr 
280 

Gly His lie 

Met lie Gin 

Phe Ser Asp 
330 

ser His ser 
345 

Leu Tyr Arg 
360 

Tyr Asp Gly 
Asp lie Tyr 



Asn Leu Thr 
410 

His Asn Ala 
425 



Gin lie Glu 
75 

Gin val Tyr 

Gin lie Gin 

Phe Arg lie 
125 

Pro Glu Gly 
140 

Asp Ala Gly 
155 

Gly Asn Gly 

Leu Ala Ala 

ser Gly ser 
205 

Gly lie Phe 
220 

Lys Thr val 

235 

Asp Gly Gly 
lie Arg Gin 

His Leu Phe 

285 

Gly Leu Gly 
300 

Gin Ala Ala 

315 

His Gly His 
Asp Ser Asp 

lie His Trp 

365 

Pro Gin Gly 
380 

Ser Tyr Asp 
395 

Asp Asn Arg 
Gly ser Met 



val Asp Gly 
80 

Lys Gin Ser 
95 

Asp Ser Glu 
110 

Gly Asp lie 

Gly Arg Ala 

Gly Lys Leu 
160 

Lys lie Glu 
175 

Ala Asp lie 
190 

val Leu Tyr 

Gly Gly Lys 

Asn Gly lie 
240 

Gly Gly Thr 
255 

val Leu Asp 

270 

Gly Ser Arg 

Lys lie Gin 

lie Lys Gly 
320 

Glu val His 
335 

Glu Ala Gly 
350 

Asp Gly Tyr 
Gly Gly Tyr 



lie Lys Gly 
400 

Ser Thr Gly 
415 

Leu Thr Gin 
430 
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Gly Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr Ser Pro Glu Leu 
435 440 445 

Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp lie 
450 455 460 

val Lys Asn lie lie Gly Ala Ala Gly Glu lie Val Gly Ala Gly Asp 
465 470 475 480 

Ala val Gin Gly lie Ser Glu Gly Ser Asn lie Ala val Met His Gly 
485 490 495 

Leu Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg lie Asn Asp Leu 
500 505 ~ 510 

Ala Asp Met Ala Gin Leu Lys Asp Tyr Ala Ala Ala Ala lie Arg Asp 
515 520 525 

Trp Ala val Gin Asn Pro Asn Ala Ala Gin Gly lie Glu Ala Val Ser 
530 535 540 

Asn lie Phe Met Ala Ala lie Pro lie Lys Gly lie Gly Ala val Arg 
545 550 555 560 

Gly Lys Tyr Gly Leu Gly Gly lie Thr Ala His pro lie Lys Arg ser 
565 570 575 

Gin Met Gly Ala lie Ala Leu Pro Lys Gly Lys Ser Ala Val Ser Asp 
580 585 590 

Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser 
595 600 605 

Arg Asn lie Arg Ser Asn Leu Glu Gin Arg Tyr Gly Lys Glu Asn lie 
610 615 620 

Thr Ser Ser Thr val Pro Pro Ser Asn Gly Lys Asn Val Lys Leu Ala 
625 630 635 640 

Asp Gin Arg His Pro Lys Thr Gly val Pro Phe Asp Gly Lys Gly Phe 
645 650 655 

Pro Asn Phe Glu Lys His Val Lys Tyr Asp Thr Leu Glu His His His 
660 665 670 

His His His 
675 

<210> 116 
<211> 249 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Novel protein 
<400> 116 

Met Lys Lys Tyr Leu Phe Arg Ala Ala Leu Tyr Gly lie Ala Ala Ala 
15 10 15 

lie Leu Ala Ala Ala lie Pro Ala Gly Asn Asp Ala Thr Thr Lys Pro 
20 25 30 

Asp Leu Tyr Tyr Leu Lys Asn Glu Gin Ala lie Asp ser Leu Lys Leu 
35 40 45 

Leu Pro Pro Pro Pro Glu val Gly Ser lie Gin Phe Leu Asn Asp Gin 
50 5 5 60 
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Ala Met Tyr Glu Lys Gly Arg Met Leu Arg Asn Thr Glu Arg Gly Lys 
65 70 75 ^ 80 

Gin Ala Gin Ala Asp Ala Asp Leu Ala Ala Gly Gly val Ala Thr Ala 
85 90 95 

Phe Ser Gly Ala Phe Gly Tyr Pro lie Thr Glu Lys Asp Ser Pro Glu 
100 105 110 

Leu Tyr Lys Leu Leu Thr Asn Met lie Glu Asp Ala Gly Asp Leu Ala 
115 120 125 

Thr Arg Ser Ala Lys Glu His Tyr Met Arg lie Arg Pro Phe Ala Phe 
130 135 140 

Tyr Gly Thr Glu Thr cys Asn Thr Lys Asp Gin Lys Lys Leu ser Thr 
145 150 155 160 

Asn Gly Ser Tyr Pro Ser Gly His Thr Ser lie Gly Trp Ala Thr Ala 
165 170 175 

Leu val Leu Ala Glu val Asn Pro Ala Asn Gin Asp Ala lie Leu Glu 
180 185 190 

Arg Gly Tyr Gin Leu Gly Gin Ser Arg val lie Cys Gly Tyr His Trp 
195 200 205 

Gin ser Asp val Asp Ala Ala Arg lie val Gly ser Ala Ala val Ala 
210 215 220 

Thr Leu His Ser Asp Pro Ala Phe Gin Ala Gin Leu Ala Lys Ala Lys 
225 230 235 240 

Gin Glu Phe Ala Gin Lys Ser Gin Lys 
245 

<210> 117 
<211> 66 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ll linker 

<220> 

<221> N 

<222> 13 

<223> A, T/U, G or C 

<400> 117 

tatgaartay ytnttymgcg ccgccctgta cggcatcgcc gccgccatcc tcgccgccgc 
gatccc 

<210> 118 

<211> 69 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> si linker 

<220> 

<221> n 

<222> 25, 28 

<223> A, T/U, G or C 

<400> 118 

tatgaaaaaa tacctattcc grgcngcnyt rtayggsatc gccgccgcca tcctcgccgc 
cgcgatccc 
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<210> 119 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 9Ll-a 

<400> 119 

atgaagaagt accttttcag cgccgcc 

<210> 120 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 9Ll-e 

<400> 120 

atgaaaaaat actttttccg cgccgcc 

<210> 121 

<211> 27 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> 9Ll-d 

<400> 121 

atgaaaaaat actttttccg cgccgcc 

<210> 122 

<211> 60 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> 9Ll-f 

<400> 122 

atgaaaaaat atctctttag cgccgccctg tacggcatcg ccgccgccat cctcgccgcc 

<210> 123 

<211> 60 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> 919sp 

<400> 123 

atgaaaaaat acctattccg cgccgccctg tacggcatcg ccgccgccat cctcgccgcc 

<210> 124 

<211> 9 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 9Lla 

<400> 124 

Met Lys Lys Tyr Leu Phe Ser Ala Ala 



<210> 125 
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10 



20 



25 



30 



40 



45 



50 



<211> 9 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 9l_le 

<400> 125 

Met Lys Lys Tyr Phe Phe Arg Ala Ala 



<210> 126 

<211> 9 

<212> PRT 

<213> Artificial Sequence 



15 <220> 

<223> 9Lld 



<400> 126 

Met Lys Lys Tyr Phe Phe Arg Ala Ala 
1 5 

<210> 127 

<211> 20 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 9Llf 

<400> 127 

Met Lys Lys Tyr Leu Phe Ser Ala Ala Leu Tyr Gly lie Ala Ala Ala 
1 5 10 15 

lie Leu Ala Ala 

20 



<210> 128 

<211> 20 

<212> PRT 

35 <213> Artificial Sequence 

<220> 

<223> 9Llsp 



<400> 128 

Met Lys Lys Tyr Leu Phe Arg Ala Ala Leu Tyr Gly lie Ala Ala Ala 
1 5 10 15 

lie Leu Ala Ala 

20 

<210> 129 
<211> 42 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> 9Sl-e 



<400> 129 

atgaaaaaat acctattcat cgccgccgcc atcctcgccg cc 42 

<210> 130 

<211> 60 

55 <212> DNA 

<213> Artificial Sequence 
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<220> 

<223> 9S1-C 
<400> 130 

atgaaaaaat acctattccg agctgcccaa tacggcatcg ccgccgccat cctcgccgcc 

<210> 131 

<211> 60 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 9Sl-b 

<400> 131 

atgaaaaaat acctattccg ggccgcccaa tacggcatcg ccgccgccat cctcgccgcc 

<210> 132 

<211> 60 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 9Sl-i 

<400> 132 

atgaaaaaat acctattccg ggcggctttg tacgggatcg ccgccgccat cctcgccgcc 

<210> 133 
<211> 14 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 9Sle 
<400> 133 

Met Lys Lys Tyr Leu Phe lie Ala Ala Ala lie Leu Ala Ala 
15 10 

<210> 134 

<211> 20 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 9slc 

<400> 134 

Met Lys Lys Tyr Leu Phe Arg Ala Ala Gin Tyr Gly lie Ala Ala Ala 
1 5 10 15 

lie Leu Ala Ala 
20 

<210> 135 
<211> 20 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> 9Slb 
<400> 135 

Met Lys Lys Tyr Leu Phe Arg Ala Ala Gin Tyr Gly lie Ala Ala Ala 
1 5 10 15 

He Leu Ala Ala 

20 
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<210> 136 

<211> 20 

<212> PRT 

<213> Artificial Sequence 



<223> 


9Sli 






Met Lys 
1 


136 
Lys 


Tyr 


Leu 
5 


lie Leu 


Ala 


Ala 
20 




<210> 
<211> 
<212> 
<213> 


137 
467 

PRT 

Arti fi cial 


<220> 
<223> 


730 






<400> 
val Lys 
1 


137 
Pro 


Leu 


Arg 
5 


Ala Ala 


Ala 


Ala 


Leu 



10 15 



10 15 

e Gin Pro Ala Leu Ala Ala Asp Leu Ala Gin 
20 25 30 

Asp Pro Phe lie Thr Asp Asn Ala Gin Arg Gin His Tyr Glu Pro Gly 
35 40 45 

Gly Lys Tyr His Leu Phe Gly Asp Pro Arg Gly Ser Val Ser Asp Arg 
50 55 60 

Thr Gly Lys lie Asn Val lie Gin Asp Tyr Thr His Gin Met Gly Asn 
65 70 75 80 

Leu Leu lie Gin Gin Ala Asn lie Asn Gly Thr lie Gly Tyr His Thr 
85 90 95 

Arg Phe ser Gly His Gly His Glu Glu His Ala Pro Phe Asp Asn His 
100 105 110 

Ala Ala Asp Ser Ala Ser Glu Glu Lys Gly Asn Val Asp Glu Gly Phe 
115 120 125 

Thr val Tyr Arg Leu Asn Trp Glu Gly His Glu His His Pro Ala Asp 
130 135 140 

Ala Tyr Asp Gly Pro Lys Gly Gly Asn Tyr Pro Lys Pro Thr Gly Ala 
145 150 155 160 

Arg Asp Glu Tyr Thr Tyr His val Asn Gly Thr Ala Arg Ser lie Lys 
165 170 175 

Leu Asn Pro Thr Asp Thr Arg Ser lie Arg Gin Arg lie Ser Asp Asn 
180 185 190 

Tyr Ser Asn Leu Gly Ser Asn Phe Ser Asp Arg Ala Asp Glu Ala Asn 
195 200 205 

Arg Lys Met Phe Glu His Asn Ala Lys Leu Asp Arg Trp Gly Asn Ser 

210 215 220 

Met Glu Phe He Asn Gly Val Ala Ala Gly Ala Leu Asn Pro Phe lie 
225 230 235 240 
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Ser Ala Gly Glu Ala Leu Gly lie Gly Asp lie Leu Tyr Gly Thr Arg 
245 250 255 

Tyr Ala lie Asp Lys Ala Ala Met Arg Asn lie Ala Pro Leu Pro Ala 
260 265 270 

Glu Gly Lys Phe Ala val lie Gly Gly Leu Gly Ser val Ala Gly Phe 
275 280 285 

Glu Lys Asn Thr Arg Glu Ala Val Asp Arg Trp lie Gin Glu Asn Pro 
290 295 300 

Asn Ala Ala Glu Thr val Glu Ala Val Phe Asn Val Ala Ala Ala Ala 
305 310 315 320 

Lys val Ala Lys Leu Ala Lys Ala Ala Lys Pro Gly Lys Ala Ala Val 
325 330 335 

Ser Gly Asp Phe Ala Asp Ser Tyr Lys Lys Lys Leu Ala Leu Ser Asp 
340 345 350 

Ser Ala Arg Gin Leu Tyr Gin Asn Ala Lys Tyr Arg Glu Ala Leu Asp 
355 360 365 

lie His Tyr Glu Asp Leu lie Arg Arg Lys Thr Asp Gly Ser Ser Lys 
370 375 380 

Phe lie Asn Gly Arg Glu lie Asp Ala Val Thr Asn Asp Ala Leu lie 
385 390 395 400 

Gin Ala Lys Arg Thr lie Ser Ala lie Asp Lys Pro Lys Asn Phe Leu 
405 410 415 

Asn Gin Lys Asn Arg Lys Gin lie Lys Ala Thr lie Glu Ala Ala Asn 
420 425 430 

Gin Gin Gly Lys Arg Ala Glu Phe Trp Phe Lys Tyr Gly val His Ser 
435 " 440 445 

Gin val Lys Ser Tyr lie Glu Ser Lys Gly Gly lie val Lys Thr Gly 
450 455 ' 460 

Leu Gly Asp 
465 

<210> 138 
<211> 377 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 730-C1 
<400> 138 

Met Ala Asp Leu Ala Gin Asp Pro Phe lie Thr Asp Asn Ala Gin Arg 
15 10 15 

Gin His Tyr Glu Pro Gly Gly Lys Tyr His Leu Phe Gly Asp Pro Arg 
20 25 30 

Gly ser val Ser Asp Arg Thr Gly Lys lie Asn val lie Gin Asp Tyr 
35 40 45 

Thr His Gin Met Gly Asn Leu Leu lie Gin Gin Ala Asn lie Asn Gly 
50 55 60 

Thr lie Gly Tyr His Thr Arg Phe Ser Gly His Gly His Glu Glu His 
65 70 75 80 
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Ala Pro Phe Asp Asn His Ala Ala Asp Ser Ala Ser Glu Glu Lys Gly 
85 90 95 

Asn Val Asp Glu Gly Phe Thr Val Tyr Arg Leu Asn Trp Glu Gly His 
100 105 110 

Glu His His Pro Ala Asp Ala Tyr Asp Gly Pro Lys Gly Gly Asn Tyr 
115 120 125 

Pro Lys Pro Thr Gly Ala Arg Asp Glu Tyr Thr Tyr His val Asn Gly 
130 135 140 

Thr Ala Arg Ser lie Lys Leu Asn Pro Thr Asp Thr Arg Ser lie Arg 
145 150 155 160 

Gin Arg lie Ser Asp Asn Tyr Ser Asn Leu Gly Ser Asn Phe Ser Asp 
165 170 175 

Arg Ala Asp Glu Ala Asn Arg Lys Met Phe Glu His Asn Ala Lys Leu 
180 185 190 

Asp Arg Trp Gly Asn Ser Met Glu Phe lie Asn Gly Val Ala Ala Gly 
195 200 205 

Ala Leu Asn Pro Phe lie Ser Ala Gly Glu Ala Leu Gly lie Gly Asp 
210 215 220 

lie Leu Tyr Gly Thr Arg Tyr Ala lie Asp Lys Ala Ala Met Arg Asn 
225 230 235 240 

lie Ala Pro Leu Pro Ala Glu Gly Lys Phe Ala val lie Gly Gly Leu 
245 250 255 

Gly Ser val Ala Gly Phe Glu Lys Asn Thr Arg Glu Ala val Asp Arg 
260 265 270 

Trp He Gin Glu Asn Pro Asn Ala Ala Glu Thr Val Glu Ala val Phe 
275 280 285 

Asn val Ala Ala Ala Ala Lys val Ala Lys Leu Ala Lys Ala Ala Lys 
290 295 300 

Pro Gly Lys Ala Ala Val Ser Gly Asp Phe Ala Asp Ser Tyr Lys Lys 
305 310 315 320 

Lys Leu Ala Leu Ser Asp Ser Ala Arg Gin Leu Tyr Gin Asn Ala Lys 
325 330 335 

Tyr Arg Glu Ala Leu Asp lie His Tyr Glu Asp Leu lie Arg Arg Lys 
340 345 350 

Thr Asp Gly Ser Ser Lys Phe lie Asn Gly Arg Glu lie Asp Ala val 
3 55 360 365 

Thr Asn Asp Ala Leu lie Gin Ala Arg 
370 375 

<210> 139 
<211> 353 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 730-C2 
<400> 139 

Met Ala Asp Leu Ala Gin Asp Pro Phe lie Thr Asp Asn Ala Gin Arg 
15 10 15 
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Gin His Tyr Glu Pro Gly Gly Lys Tyr His Leu Phe Gly Asp Pro Arg 
20 25 30 

Gly Ser val Ser Asp Arg Thr Gly Lys lie Asn val lie Gin Asp Tyr 
35 40 45 

Thr His Gin Met Gly Asn Leu Leu lie Gin Gin Ala Asn lie Asn Gly 
50 55 60 

Thr He Gly Tyr His Thr Arg Phe Ser Gly His Gly His Glu Glu His 
65 70 75 80 

Ala Pro Phe Asp Asn His Ala Ala Asp Ser Ala Ser Glu Glu Lys Gly 
85 90 95 

Asn Val Asp Glu Gly Phe Thr Val Tyr Arg Leu Asn Trp Glu Gly His 
100 105 110 

Glu His His Pro Ala Asp Ala Tyr Asp Gly Pro Lys Gly Gly Asn Tyr 
115 120 12 5 

pro Lys Pro Thr Gly Ala Arg Asp Glu Tyr Thr Tyr His val Asn Gly 
130 13 5 140 

Thr Ala Arg Ser lie Lys Leu Asn Pro Thr Asp Thr Arg Ser lie Arg 
145 150 155 160 

Gin Arg lie Ser Asp Asn Tyr Ser Asn Leu Gly Ser Asn Phe Ser Asp 
165 170 175 

Arg Ala Asp Glu Ala Asn Arg Lys Met Phe Glu His Asn Ala Lys Leu 
180 185 190 

Asp Arg Trp Gly Asn Ser Met Glu Phe lie Asn Gly val Ala Ala Gly 
195 200 205 

Ala Leu Asn Pro Phe lie Ser Ala Gly Glu Ala Leu Gly lie Gly Asp 
210 215 220 

lie Leu Tyr Gly Thr Arg Tyr Ala lie Asp Lys Ala Ala Met Arg Asn 
225 230 235 240 

lie Ala Pro Leu Pro Ala Glu Gly Lys Phe Ala val lie Gly Gly Leu 
245 250 255 

Gly Ser Val Ala Gly Phe Glu Lys Asn Thr Arg Glu Ala val Asp Arg 
260 265 270 

Trp lie Gin Glu Asn Pro Asn Ala Ala Glu Thr val Glu Ala val Phe 
275 280 285 

Asn Val Ala Ala Ala Ala Lys val Ala Lys Leu Ala Lys Ala Ala Lys 
290 295 300 

Pro Gly Lys Ala Ala val Ser Gly Asp Phe Ala Asp Ser Tyr Lys Lys 
305 310 315 320 

Lys Leu Ala Leu Ser Asp Ser Ala Arg Gin Leu Tyr Gin Asn Ala Lys 
325 330 335 

Tyr Arg Glu Ala Leu Gly Lys val Arg lie Ser Gly Glu lie Leu Leu 
340 345 350 

Gly 

<210> 140 
<211> 2019 
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<212> DNA 

<213> Artificial sequence 



10 



15 



20 



25 



30 



35 



40 



45 



50 



<220> 
<223> 



ORF46. 1-741 



<400> 140 
atgtcagatt tggcaaacga 
cccgacggga aataccacct 
atcggattgg gaaaaataca 
attaaaggaa atatcggcta 
cccttcgaca accatgcctc 
tttagccttt accgcatcca 
gggccacagg gcggcggcta 
ataaaaggcg ttgcccaaaa 
cggcttgccg accgtttcca 
ttcaaacgcg ccacccgata 
ttcaacggca ctgcagatat 
gcaggcgatg ccgtgcaggg 
ggtctgcttt ccaccgaaaa 
ctcaaagact atgccgcagc 
caaggcatag aagccgtcag 
gctgttcggg gaaaatacgg 
atgggcgcga tcgcattgcc 
gcatacgcca aatacccgtc 
cgttacggca aagaaaacat 
aaactggcag accaacgcca 
aattttgaga agcacgtgaa 
atcggtgcgg ggcttgccga 
cagtctttga cgctggatca 
ggtgcggaaa aaacttatgg 
aaggtcagcc gtttcgactt 
gagagtggag agttccaagt 
gagcaaatac aagattcgga 
ggcgacatag cgggcgaaca 
tatcgcggga cggcgttcgg 
ttcgccgcca agcagggaaa 
gacctggccg ccgccgatat 
gtcctttaca accaagccga 
caggaagttg ccggcagcgc 
gccgccaagc aactcgagca 



ttcttttatc 
attcggcagc 
aagccatcag 
cattgtccgc 
acattccgat 
ttgggacgga 
tcccgctccc 
tatccgcctc 
caatgccggt 
cagccccgag 
cgttaaaaac 
cataagcgaa 
caagatggcg 
agccatccgc 
caatatcttt 
cttgggcggc 
gaaagggaaa 
cccttaccat 
cacctcctca 
cccgaagaca 
atatgatacg 
tgcactaacc 
gtccgtcagg 
aaacggtgac 
tatccgccaa 
atacaaacaa 
gcattccggg 
tacatctttt 
ttcagacgat 
cggcaaaatc 
caagccggat 
gaaaggcagt 
ggaagtgaaa 
ccaccaccac 



cggcaggttc 
aggggggaac 
ttgggcaacc 
ttttccgatc 
tctgatgaag 
tacgaacacc 
aaaggcgcga 
aacctgaccg 
agtatgctga 
ctggacagat 
atcatcggcg 
ggctcaaaca 
cgcatcaacg 
gattgggcag 
atggcagcca 
atcacggcac 
tccgccgtca 
tcccgaaata 
accgtgccgc 
ggcgtaccgt 
ggatccggag 
gcaccgctcg 
aaaaacgaga 
agcctcaata 
atcgaagtgg 
agccattccg 
aagatggttg 
gacaagcttc 
gccggcggaa 
gaacatttga 
ggaaaacgcc 
tactccctcg 
accgtaaacg 
caccactga 



tcgaccgtca 
ttgccgagcg 
tgatgattca 
acgggcacga 
ccggtagtcc 
atcccgccga 
gggatatata 
acaaccgcag 
cgcaaggagt 
cgggcaatgc 
cggcaggaga 
ttgctgtcat 
atttggcaga 
tccaaaaccc 
tccccatcaa 
atcctatcaa 
gcgacaattt 
tccgttcaaa 
cgtcaaacgg 
ttgacggtaa 
ggggtggtgt 
accataaaga 
aactgaagct 
cgggcaaatt 
acgggcagct 
ccttaaccgc 
cgaaacgcca 
ccgaaggcgg 
aactgaccta 
aatcgccaga 
atgccgtcat 
gtatctttgg 
gcatacgcca 



gcatttcgaa 
cagcggccat 
acaggcggcc 
agtccattcc 
cgttgacgga 
cggctatgac 
cagctacgac 
caccggacaa 
aggcgacgga 
cgccgaagcc 
aattgtcggc 
gcacggcttg 
tatggcgcaa 
caatgccgca 
agggattgga 
gcggtcgcag 
tgccgatgcg 
cttggagcag 
caaaaatgtc 
agggtttccg 
cgccgccgac 
caaaggtttg 
ggcggcacaa 
gaagaacgac 
cattaccttg 
ctttcagacc 
gttcagaatc 
cagggcgaca 
caccatagat 
actcaatgtc 
cagcggttcc 
cggaaaagcc 
tatcggcctt 



<210> 141 

<211> 672 

<212> PRT 

<213> Artificial Sequence 



<220> 
<223> 



ORF46. 1-741 



55 



<400> 141 

Met Ser Asp Leu Ala Asn Asp Ser Phe lie Arg 
1 5 10 

Gin His Phe Glu Pro Asp Gly Lys Tyr His Leu 
20 25 

Glu Leu Ala Glu Arg Ser Gly His lie Gly Leu 
35 " 40 

His Gin Leu Gly Asn Leu Met lie Gin Gin Ala 
50 55 

lie Gly Tyr lie val Arg Phe ser Asp His Gly 
65 70 75 

Pro Phe Asp Asn His Ala ser His ser Asp ser 
85 90 



Gin Val Leu Asp Arg 
15 

Phe Gly Ser Arg Gly 
30 

Gly Lys lie Gin Ser 
45 

Ala lie Lys Gly Asn 
60 

His Glu Val His ser 
80 

Asp Glu Ala Gly ser 
95 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2019 
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Pro val Asp 

His His Pro 
115 

Ala Pro Lys 
130 

Ala Gin Asn 
145 

Arg Leu Ala 

val Gly Asp 

Arg ser Gly 
195 

Lys Asn lie 
210 

val Gin Gly 

225 

Gly Leu Leu 

Asp Met Ala 

Ala val Gin 
275 

lie Phe Met 
290 

Lys Tyr Gly 
305 

Met Gly Ala 

Phe Ala Asp 

Asn lie Arg 
355 

Ser Ser Thr 
370 

Gin Arg His 
385 

Asn Phe Glu 
val Ala Ala 



Gly Phe Ser 
100 

Ala Asp Gly 

Gly Ala Arg 

lie Arg Leu 
150 

Asp Arg Phe 
165 

Gly Phe Lys 
180 

Asn Ala Ala 
lie Gly Ala 



lie ser Glu 
230 

Ser Thr Glu 
245 

Gin Leu Lys 
260 

Asn Pro Asn 



Leu Tyr 



Arg lie 
105 



Gly Pro 
Tyr Ser 
Asn Leu Thr Asp 



Tyr Asp 
120 

Asp lie 
135 



Leu Asp His 
435 

val Arg Lys 
450 



Ala Ala lie 

Leu Gly Gly 
310 

lie Ala Leu 
325 

Ala Ala Tyr 
340 

Ser Asn Leu 

val Pro Pro 

Pro Lys Thr 
390 

Lys His val 
405 

Asp lie Gly 
420 

Lys Asp Lys 
Asn Glu Lys 



His Asn 

Arg Ala 

Glu Ala 
200 

Ala Gly 
215 

Gly ser 

Asn Lys 

Asp Tyr 

Ala Ala 
280 

Pro lie 

295 

lie Thr 

Pro Lys 

Ala Lys 

Glu Gin 
360 

Ser Asn 
375 

Gly val 
Lys Tyr 
Ala Gly 



Gly Leu 
440 

Leu Lys 
455 



Ala Gly 
170 

Thr Arg 

185 

Phe Asn 

Glu lie 

Asn lie 

Met Ala 
250 

Ala Ala 

265 

Gin Gly 

Lys Gly 

Ala His 

Gly Lys 
330 

Tyr Pro 
345 

Arg Tyr 

Gly Lys 

Pro Phe 

Asp Thr 
410 

Leu Ala 

425 

Gin ser 
Leu Ala 



His Trp Asp 

Gin Gly Gly 
125 

Tyr Asp lie 
140 

Asn Arg Ser 
155 

Ser Met Leu 

Tyr Ser Pro 

Gly Thr Ala 
205 

val Gly Ala 
220 

Ala val Met 
235 

Arg lie Asn 
Ala Ala lie 

He Glu Ala 

285 

lie Gly Ala 
300 

Pro lie Lys 
315 

ser Ala val 

Ser Pro Tyr 

Gly Lys Glu 
365 

Asn Val Lys 
380 

Asp Gly Lys 
395 

Gly Ser Gly 
Asp Ala Leu 



Leu Thr Leu 
445 

Ala Gin Gly 
460 



Gly Tyr Glu 
110 

Gly Tyr Pro 

Lys Gly Val 

Thr Gly Gin 
160 

Thr Gin Gly 

175 

Glu Leu Asp 
190 

Asp lie val 

Gly Asp Ala 

His Gly Leu 
240 

Asp Leu Ala 
255 

Arg Asp Trp 
270 

val ser Asn 

val Arg Gly 

Arg Ser Gin 
320 

Ser Asp Asn 
335 

Hi s Ser Arg 
350 

Asn lie Thr 

Leu Ala Asp 

Gly Phe Pro 
400 

Gly Gly Gly 
415 

Thr Ala Pro 

430 

Asp Gin ser 
Ala Glu Lys 
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Thr Tyr Gly Asn Gly Asp Ser Leu Asn Thr Gly Lys Leu Lys Asn Asp 
465 470 475 ' 480 

Lys Val Ser Arg Phe Asp Phe lie Arg Gin lie Glu Val Asp Gly Gin 
485 490 495 

Leu He Thr Leu Glu Ser Gly Glu Phe Gin Val Tyr Lys Gin Ser His 
500 505 510 

Ser Ala Leu Thr Ala Phe Gin Thr Glu Gin lie Gin Asp Ser Glu His 
515 520 525 

Ser Gly Lys Met Val Ala Lys Arg Gin Phe Arg lie Gly Asp He Ala 
530 535 540 

Gly Glu His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr 
545 550 555 560 

Tyr Arg Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr 
565 570 575 

Tyr Thr lie Asp Phe Ala Ala Lys Gin Gly Asn Gly Lys He Glu His 
580 585 590 

Leu Lys Ser Pro Glu Leu Asn val Asp Leu Ala Ala Ala Asp lie Lys 
595 600 605 

Pro Asp Gly Lys Arg His Ala Val lie Ser Gly Ser val Leu Tyr Asn 
610 615 620 

Gin Ala Glu Lys Gly Ser Tyr Ser Leu Gly lie Phe Gly Gly Lys Ala 
625 630 635 640 

Gin Glu val Ala Gly ser Ala Glu val Lys Thr val Asn Gly lie Arg 
645 650 655 

His lie Gly Leu Ala Ala Lys Gin Leu Glu His His His His His His 
660 665 670 

<210> 142 
<211> 2421 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 0RF46. 1-961 
<400> 142 

atgtcagatt tggcaaacga ttcttttatc cggcaggttc tcgaccgtca gcatttcgaa 
cccgacggga aataccacct attcggcagc aggggggaac ttgccgagcg cagcggccat 
atcggattgg gaaaaataca aagccatcag ttgggcaacc tgatgattca acaggcggcc 
attaaaggaa atatcggcta cattgtccgc ttttccgatc acgggcacga agtccattcc 
cccttcgaca accatgcctc acattccgat tctgatgaag ccggtagtcc cgttgacgga 
tttagccttt accgcatcca ttgggacgga tacgaacacc atcccgccga cggctatgac 
gggccacagg gcggcggcta tcccgctccc aaaggcgcga gggatatata cagctacgac 
ataaaaggcg ttgcccaaaa tatccgcctc aacctgaccg acaaccgcag caccggacaa 
cggcttgccg accgtttcca caatgccggt agtatgctga cgcaaggagt aggcgacgga 
ttcaaacgcg ccacccgata cagccccgag ctggacagat cgggcaatgc cgccgaagcc 
ttcaacggca ctgcagatat cgttaaaaac atcatcggcg cggcaggaga aattgtcggc 
gcaggcgatg ccgtgcaggg cataagcgaa ggctcaaaca ttgctgtcat gcacggcttg 
ggtctgcttt ccaccgaaaa caagatggcg cgcatcaacg atttggcaga tatggcgcaa 
ctcaaagact atgccgcagc agccatccgc gattgggcag tccaaaaccc caatgccgca 
caaggcatag aagccgtcag caatatcttt atggcagcca tccccatcaa agggattgga 
gctgttcggg gaaaatacgg cttgggcggc atcacggcac atcctatcaa gcggtcgcag 
atgggcgcga tcgcattgcc gaaagggaaa tccgccgtca gcgacaattt tgccgatgcg 
gcatacgcca aatacccgtc cccttaccat tcccgaaata tccgttcaaa cttggagcag 
cgttacggca aagaaaacat cacctcctca accgtgccgc cgtcaaacgg caaaaatgtc 
aaactggcag accaacgcca cccgaagaca ggcgtaccgt ttgacggtaa agggtttccg 
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aattttgaga agcacgtgaa atatgatacg ggatccggag gaggaggagc cacaaacgac 1260 

gacgatgtta aaaaagctgc cactgtggcc attgctgctg cctacaacaa tggccaagaa 1320 

atcaacggtt tcaaagctgg agagaccatc tacgacattg atgaagacgg cacaattacc 1380 

aaaaaagacg caactgcagc cgatgttgaa gccgacgact ttaaaggtct gggtctgaaa 1440 

5 aaagtcgtga ctaacctgac caaaaccgtc aatgaaaaca aacaaaacgt cgatgccaaa 1500 

gtaaaagctg cagaatctga aatagaaaag ttaacaacca agttagcaga cactgatgcc 1560 

gctttagcag atactgatgc cgctctggat gcaaccacca acgccttgaa taaattggga 1620 

gaaaatataa cgacatttgc tgaagagact aagacaaata tcgtaaaaat tgatgaaaaa 1680 

ttagaagccg tggctgatac cgtcgacaag catgccgaag cattcaacga tatcgccgat 1740 

tcattggatg aaaccaacac taaggcagac gaagccgtca aaaccgccaa tgaagccaaa 1800 

10 cagacggccg aagaaaccaa acaaaacgtc gatgccaaag taaaagctgc agaaactgca 1860 

gcaggcaaag ccgaagctgc cgctggcaca gctaatactg cagccgacaa ggccgaagct 1920 

gtcgctgcaa aagttaccga catcaaagct gatatcgcta cgaacaaaga taatattgct 1980 

aaaaaagcaa acagtgccga cgtgtacacc agagaagagt ctgacagcaa atttgtcaga 2040 

attgatggtc tgaacgctac taccgaaaaa ttggacacac gcttggcttc tgctgaaaaa 2100 

tccattgccg atcacgatac tcgcctgaac ggtttggata aaacagtgtc agacctgcgc 2160 

75 aaagaaaccc gccaaggcct tgcagaacaa gccgcgctct ccggtctgtt ccaaccttac 2220 

aacgtgggtc ggttcaatgt aacggctgca gtcggcggct acaaatccga atcggcagtc 2280 

gccatcggta ccggcttccg ctttaccgaa aactttgccg ccaaagcagg cgtggcagtc 2340 

ggcacttcgt ccggttcttc cgcagcctac catgtcggcg tcaattacga gtggctcgag 2400 

caccaccacc accaccactg a 2421 

pn <210> 143 

<211> 806 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> 0RF46. 1-961 

25 

<400> 143 

Met Ser Asp Leu Ala Asn Asp Ser Phe lie Arg Gin val Leu Asp Arg 
1 5 10 15 

Gin His Phe Glu Pro Asp Gly Lys Tyr His Leu Phe Gly ser Arg Gly 

so 20 25 30 

Glu Leu Ala Glu Arg Ser Gly His lie Gly Leu Gly Lys lie Gin Ser 
35 40 45 

His Gin Leu Gly Asn Leu Met He Gin Gin Ala Ala He Lys Gly Asn 

35 50 55 60 

lie Gly Tyr lie Val Arg Phe Ser Asp His Gly His Glu Val His Ser 
65 70 75 80 

Pro Phe Asp Asn His Ala Ser His Ser Asp Ser Asp Glu Ala Gly Ser 
85 90 95 

40 

pro val Asp Gly Phe ser Leu Tyr Arg lie His Trp Asp Gly Tyr Glu 
100 105 110 

His His Pro Ala Asp Gly Tyr Asp Gly Pro Gin Gly Gly Gly Tyr Pro 
115 120 12 5 

45 

Ala Pro Lys Gly Ala Arg Asp lie Tyr Ser Tyr Asp lie Lys Gly Val 
130 135 140 

Ala Gin Asn lie Arg Leu Asn Leu Thr Asp Asn Arg Ser Thr Gly Gin 
145 150 155 160 

50 

Arg Leu Ala Asp Arg Phe His Asn Ala Gly Ser Met Leu Thr Gin Gly 
165 170 175 

Val Gly Asp Gly Phe Lys Arg Ala Thr Arg Tyr ser Pro Glu Leu Asp 
180 185 190 

55 Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala Asp He Val 

195 200 205 
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Lys Asn lie lie Gly Ala Ala Gly Glu lie Val Gly Ala Gly Asp Ala 
210 215 220 

val Gin Gly lie Ser Glu Gly Ser Asn He Ala Val Met His Gly Leu 
225 230 235 240 

Gly Leu Leu ser Thr Glu Asn Lys Met Ala Arg lie Asn Asp Leu Ala 
245 250 255 

Asp Met Ala Gin Leu Lys Asp Tyr Ala Ala Ala Ala lie Arg Asp Trp 
260 265 270 

Ala Val Gin Asn Pro Asn Ala Ala Gin Gly lie Glu Ala val Ser Asn 
275 280 285 

lie Phe Met Ala Ala lie Pro lie Lys Gly lie Gly Ala val Arg Gly 
290 295 300 

Lys Tyr Gly Leu Gly Gly lie Thr Ala His Pro He Lys Arg Ser Gin 
305 310 315 ~ 320 

Met Gly Ala lie Ala Leu Pro Lys Gly Lys Ser Ala val Ser Asp Asn 
325 330 335 

Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His Ser Arg 
340 345 350 

Asn lie Arg Ser Asn Leu Glu Gin Arg Tyr Gly Lys Glu Asn lie Thr 
355 360 365 

Ser Ser Thr val Pro Pro Ser Asn Gly Lys Asn val Lys Leu Ala Asp 
370 375 380 

Gin Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro 
385 390 395 400 

Asn Phe Glu Lys His Val Lys Tyr Asp Thr Gly Ser Gly Gly Gly Gly 
405 410 415 

Ala Thr Asn Asp Asp Asp val Lys Lys Ala Ala Thr val Ala lie Ala 
420 425 430 

Ala Ala Tyr Asn Asn Gly Gin Glu lie Asn Gly Phe Lys Ala Gly Glu 
435 440 " 445 

Thr lie Tyr Asp lie Asp Glu Asp Gly Thr lie Thr Lys Lys Asp Ala 
450 455 460 

Thr Ala Ala Asp val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu Lys 
465 470 475 480 

Lys val Val Thr Asn Leu Thr Lys Thr val Asn Glu Asn Lys Gin Asn 
485 490 495 

val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu lie Glu Lys Leu Thr 
500 505 510 

Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala Ala 
515 520 525 

Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn lie Thr 
530 535 540 

Thr Phe Ala Glu Glu Thr Lys Thr Asn lie val Lys lie Asp Glu Lys 
545 550 555 560 

Leu Glu Ala val Ala Asp Thr val Asp Lys His Ala Glu Ala Phe Asn 

565 570 575 
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Asp lie Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu Ala 
580 585 590 

val Lys Thr Ala Asn Glu Ala Lys Gin Thr Ala Glu Glu Thr Lys Gin 
595 600 605 

Asn Val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys Ala 
610 615 620 

Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu Ala 
625 630 635 640 

Val Ala Ala Lys val Thr Asp lie Lys Ala Asp lie Ala Thr Asn Lys 
645 650 655 

Asp Asn lie Ala Lys Lys Ala Asn Ser Ala Asp val Tyr Thr Arg Glu 
660 665 670 

Glu Ser Asp Ser Lys Phe Val Arg lie Asp Gly Leu Asn Ala Thr Thr 
675 680 685 

Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser lie Ala Asp 
690 695 700 

His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr val ser Asp Leu Arg 
705 710 715 720 

Lys Glu Thr Arg Gin Gly Leu Ala Glu Gin Ala Ala Leu Ser Gly Leu 
725 730 735 

Phe Gin Pro Tyr Asn val Gly Arg Phe Asn val Thr Ala Ala val Gly 
740 745 750 

Gly Tyr Lys Ser Glu Ser Ala Val Ala lie Gly Thr Gly Phe Arg Phe 
755 760 765 

Thr Glu Asn Phe Ala Ala Lys Ala Gly Val Ala val Gly Thr Ser Ser 
770 775 780 

Gly ser Ser Ala Ala Tyr His val Gly val Asn Tyr Glu Trp Leu Glu 
785 790 795 800 

His His His His His His 
805 

<210> 144 
<211> 2256 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 0RF46.1-961C 
<400> 144 

atgtcagatt tggcaaacga ttcttttatc cggcaggttc tcgaccgtca gcatttcgaa 
cccgacggga aataccacct attcggcagc aggggggaac ttgccgagcg cagcggccat 
atcggattgg gaaaaataca aagccatcag ttgggcaacc tgatgattca acaggcggcc 
attaaaggaa atatcggcta cattgtccgc ttttccgatc acgggcacga agtccattcc 
cccttcgaca accatgcctc acattccgat tctgatgaag ccggtagtcc cgttgacgga 
tttagccttt accgcatcca ttgggacgga tacgaacacc atcccgccga cggctatgac 
gggccacagg gcggcggcta tcccgctccc aaaggcgcga gggatatata cagctacgac 
ataaaaggcg ttgcccaaaa tatccgcctc aacctgaccg acaaccgcag caccggacaa 
cggcttgccg accgtttcca caatgccggt agtatgctga cgcaaggagt aggcgacgga 
ttcaaacgcg ccacccgata cagccccgag ctggacagat cgggcaatgc cgccgaagcc 
ttcaacggca ctgcagatat cgttaaaaac atcatcggcg cggcaggaga aattgtcggc 
gcaggcgatg ccgtgcaggg cataagcgaa ggctcaaaca ttgctgtcat gcacggcttg 
ggtctgcttt ccaccgaaaa caagatggcg cgcatcaacg atttggcaga tatggcgcaa 
ctcaaagact atgccgcagc agccatccgc gattgggcag tccaaaaccc caatgccgca 
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Arg ser 

Lys Asn 
210 

Val Gin 
225 

Gly Leu 

Asp Met 

Ala val 

lie Phe 
290 

Lys Tyr 
305 

Met Gly 

Phe Ala 

Asn lie 

Ser ser 
370 

Gin Arg 
385 

Asn Phe 
Ala Thr 
Ala Ala 

Thr lie 

450 

Thr Ala 
465 

Lys val 
Val Asp 
Thr Lys 



Leu Asp 
530 

Thr Phe 
545 



Gly Asn 
195 

lie lie 

Gly lie 

Leu Ser 

Ala Gin 
260 

Gin Asn 
275 

Met Ala 
Gly Leu 
Ala He 



Asp Ala 
340 

Arg Ser 
355 



Ala Ala 

Gly Ala 

Ser Glu 
230 

Thr Glu 

245 

Leu Lys 

Pro Asn 

Ala He 

Gly Gly 
310 

Ala Leu 
325 

Ala Tyr 
Asn Leu 



Thr val Pro Pro 
Hi s Pro 
Glu Lys 



Lys Thr 
390 



Asn Asp 
420 

Tyr Asn 

435 

Tyr Asp 

Ala Asp 

val Thr 

Ala Lys 
500 

Leu Ala 

515 

Ala Thr 
Ala Glu 



His val 
405 

Asp Asp 

Asn Gly 

lie Asp 

val Glu 
470 

Asn Leu 
485 

val Lys 
Asp Thr 
Thr Asn 



Glu Thr 
550 



Glu Ala Phe 
200 

Ala Gly Glu 
215 

Gly Ser Asn 

Asn Lys Met 

Asp Tyr Ala 
265 

Ala Ala Gin 
280 

Pro lie Lys 
295 

lie Thr Ala 

Pro Lys Gly 

Ala Lys Tyr 
345 

Glu Gin Arg 
360 

Ser Asn Gly 
375 

Gly Val Pro 

Lys Tyr Asp 

val Lys Lys 
425 

Gin Glu lie 
440 

Glu Asp Gly 
455 

Ala Asp Asp 

Thr Lys Thr 



Ala Ala Glu 
505 

Asp Ala Ala 
520 

Ala Leu Asn 
535 

Lys Thr Asn 



Asn Gly 
lie val 

lie Ala 

235 

Ala Arg 
250 

Ala Ala 

Gly lie 

Gly lie 

His Pro 
315 

Lys Ser 
330 

Pro Ser 

Tyr Gly 

Lys Asn 

Phe Asp 
395 

Thr Gly 
410 

Ala Ala 
Asn Gly 
Thr lie 

Phe Lys 

475 

val Asn 
490 

Ser Glu 
Leu Ala 
Lys Leu 



He val 

555 



Thr Ala 
205 

Gly Ala 
220 

val Met 
lie Asn 
Ala lie 

Glu Ala 

285 

Gly Ala 
300 

lie Lys 

Ala Val 

Pro Tyr 

Lys Glu 
365 

val Lys 
380 

Gly Lys 

ser Gly 

Thr val 

Phe Lys 
445 

Thr Lys 
460 

Gly Leu 
Glu Asn 
lie Glu 



Asp Thr 
525 

Gly Glu 
540 

Lys lie 



Asp He Val 

Gly Asp Ala 

His Gly Leu 
240 

Asp Leu Ala 
255 

Arg Asp Trp 
270 

val Ser Asn 

val Arg Gly 

Arg Ser Gin 
320 

Ser Asp Asn 
335 

His Ser Arg 
350 

Asn lie Thr 

Leu Ala Asp 

Gly Phe Pro 
400 

Gly Gly Gly 
415 

Ala lie Ala 
430 

Ala Gly Glu 

Lys Asp Ala 

Gly Leu Lys 
480 

Lys Gin Asn 
495 

Lys Leu Thr 
510 

Asp Ala Ala 
Asn lie Thr 



Asp Glu Lys 
560 
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Leu Glu Ala 

Asp lie Ala 

Val Lys Thr 
595 

Asn val Asp 
610 

Glu Ala Ala 

625 

val Ala Ala 

Asp Asn lie 

Glu ser Asp 
675 

Glu Lys Leu 
690 

His Asp Thr 
705 

Lys Glu Thr 
Phe Gin Pro 



val Ala 
565 

Asp ser 
580 

Ala Asn 

Ala Lys 

Ala Gly 

Lys val 
645 

Ala Lys 
660 

Ser Lys 
Asp Thr 
Arg Leu 



Arg Gin 
725 

Tyr Asn 
740 



Asp Thr val 

Leu Asp Glu 

Glu Ala Lys 
600 

val Lys Ala 
615 

Thr Ala Asn 
630 

Thr Asp lie 

Lys Ala Asn 

Phe val Arg 
680 

Arg Leu Ala 
695 

Asn Gly Leu 
710 

Gly Leu Ala 
val Gly Leu 



Asp Lys 
570 

Thr Asn 
585 

Gin Thr 

Ala Glu 

Thr Ala 

Lys Ala 
650 

Ser Ala 
665 

lie Asp 
Ser Ala 
Asp Lys 



Glu Gin 
730 

Glu His 
745 



His Ala Glu 

Thr Lys Ala 

Ala Glu Glu 
605 

Thr Ala Ala 
620 

Ala Asp Lys 
635 

Asp lie Ala 
Asp val Tyr 



Gly Leu Asn 
685 

Glu Lys Ser 
700 

Thr Val Ser 
715 

Ala Ala Leu 



His His His 



Ala Phe Asn 
575 

Asp Glu Ala 
590 

Thr Lys Gin 

Gly Lys Ala 

Ala Glu Ala 
640 

Thr Asn Lys 
655 

Thr Arg Glu 
670 

Ala Thr Thr 

lie Ala Asp 



Asp Leu Arg 
720 

Ser Gly Leu 
735 

His His 
750 



<210> 146 

<211> 2421 

<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



961-ORF46.1 



<400> 146 
atggccacaa acgacgacga 
aacaatggcc aagaaatcaa 
gacggcacaa ttaccaaaaa 
ggtctgggtc tgaaaaaagt 
aacgtcgatg ccaaagtaaa 
gcagacactg atgccgcttt 
ttgaataaat tgggagaaaa 
aaaattgatg aaaaattaga 
aacgatatcg ccgattcatt 
gccaatgaag ccaaacagac 
gctgcagaaa ctgcagcagg 
gacaaggccg aagctgtcgc 
aaagataata ttgctaaaaa 
agcaaatttg tcagaattga 
gcttctgctg aaaaatccat 
gtgtcagacc tgcgcaaaga 
ctgttccaac cttacaacgt 
tccgaatcgg cagtcgccat 
gcaggcgtgg cagtcggcac 
tacgagtggg gatccggagg 
gttctcgacc gtcagcattt 
gaacttgccg agcgcagcgg 
aacctgatga ttcaacaggc 



tgttaaaaaa 
cggtttcaaa 
agacgcaact 
cgtgactaac 
agctgcagaa 
agcagatact 
tataacgaca 
agccgtggct 
ggatgaaacc 
ggccgaagaa 
caaagccgaa 
tgcaaaagtt 
agcaaacagt 
tggtctgaac 
tgccgatcac 
aacccgccaa 
gggtcggttc 
cggtaccggc 
ttcgtccggt 
aggaggatca 
cgaacccgac 
ccatatcgga 
ggccattaaa 



gctgccactg 
gctggagaga 
gcagccgatg 
ctgaccaaaa 
tctgaaatag 
gatgccgctc 
tttgctgaag 
gataccgtcg 
aacactaagg 
accaaacaaa 
gctgccgctg 
accgacatca 
gccgacgtgt 
gctactaccg 
gatactcgcc 
ggccttgcag 
aatgtaacgg 
ttccgcttta 
tcttccgcag 
gatttggcaa 
gggaaatacc 
ttgggaaaaa 
ggaaatatcg 



tggccattgc 
ccatctacga 
ttgaagccga 
ccgtcaatga 
aaaagttaac 
tggatgcaac 
agactaagac 
acaagcatgc 
cagacgaagc 
acgtcgatgc 
gcacagctaa 
aagctgatat 
acaccagaga 
aaaaattgga 
tgaacggttt 
aacaagccgc 
ctgcagtcgg 
ccgaaaactt 
cctaccatgt 
acgattcttt 
acctattcgg 
tacaaagcca 
gctacattgt 



tgctgcctac 
cattgatgaa 
cgactttaaa 
aaacaaacaa 
aaccaagtta 
caccaacgcc 
aaatatcgta 
cgaagcattc 
cgtcaaaacc 
caaagtaaaa 
tactgcagcc 
cgctacgaac 
agagtctgac 
cacacgcttg 
ggataaaaca 
gctctccggt 
cggctacaaa 
tgccgccaaa 
cggcgtcaat 
tatccggcag 
cagcaggggg 
tcagttgggc 
ccgcttttcc 
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gatcacgggc acgaagtcca ttcccccttc gacaaccatg cctcacattc cgattctgat 

gaagccggta gtcccgttga cggatttagc ctttaccgca tccattggga cggatacgaa 

caccatcccg ccgacggcta tgacgggcca cagggcggcg gctatcccgc tcccaaaggc 

gcgagggata tatacagcta cgacataaaa ggcgttgccc aaaatatccg cctcaacctg 

accgacaacc gcagcaccgg acaacggctt gccgaccgtt tccacaatgc cggtagtatg 

ctgacgcaag gagtaggcga cggattcaaa cgcgccaccc gatacagccc cgagctggac 

agatcgggca atgccgccga agccttcaac ggcactgcag atatcgttaa aaacatcatc 

ggcgcggcag gagaaattgt cggcgcaggc gatgccgtgc agggcataag cgaaggctca 

aacattgctg tcatgcacgg cttgggtctg ctttccaccg aaaacaagat ggcgcgcatc 

aacgatttgg cagatatggc gcaactcaaa gactatgccg cagcagccat ccgcgattgg 

gcagtccaaa accccaatgc cgcacaaggc atagaagccg tcagcaatat ctttatggca 

gccatcccca tcaaagggat tggagctgtt cggggaaaat acggcttggg cggcatcacg 

gcacatccta tcaagcggtc gcagatgggc gcgatcgcat tgccgaaagg gaaatccgcc 

gtcagcgaca attttgccga tgcggcatac gccaaatacc cgtcccctta ccattcccga 

aatatccgtt caaacttgga gcagcgttac ggcaaagaaa acatcacctc ctcaaccgtg 

ccgccgtcaa acggcaaaaa tgtcaaactg gcagaccaac gccacccgaa gacaggcgta 

ccgtttgacg gtaaagggtt tccgaatttt gagaagcacg tgaaatatga tacgctcgag 
caccaccacc accaccactg a 

<210> 147 

<211> 806 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 961-ORF46.1 

<400> 147 

Met Ala Thr Asn Asp Asp Asp val Lys Lys Ala Ala Thr Val Ala lie 
1 5 10 15 

Ala Ala Ala Tyr Asn Asn Gly Gin Glu lie Asn Gly Phe Lys Ala Gly 
20 25 30 

Glu Thr lie Tyr Asp lie Asp Glu Asp Gly Thr lie Thr Lys Lys Asp 
35 40 45 

Ala Thr Ala Ala Asp val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 
50 55 60 

Lys Lys Val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gin 
65 70 75 80 

Asn val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu lie Glu Lys Leu 
85 90 95 

Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 
100 105 110 

Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn He 
115 120 125 

Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn lie val Lys lie Asp Glu 
130 13 5 140 

Lys Leu Glu Ala val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 
145 150 155 160 

Asn Asp lie Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 
165 170 175 

Ala val Lys Thr Ala Asn Glu Ala Lys Gin Thr Ala Glu Glu Thr Lys 
180 185 190 

Gin Asn val Asp Ala Lys val Lys Ala Ala Glu Thr Ala Ala Gly Lys 
195 200 205 

Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 
210 215 220 
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Ala val Ala 
225 

Lys Asp Asn 

Glu Glu Ser 

Thr Glu Lys 
275 

Asp HIS Asp 

290 

Arg Lys Glu 
305 

Leu Phe Gin 

Gly Gly Tyr 

Phe Thr Glu 
355 

Ser Gly Ser 
370 

Ser Gly Gly 
385 

Val Leu Asp 

Gly Ser Arg 

Lys lie Gin 
435 

lie Lys Gly 
450 

Glu Val His 
465 

Glu Ala Gly 

Asp Gly Tyr 

Gly Gly Tyr 
515 

lie Lys Gly 
530 

Ser Thr Gly 
545 

Leu Thr Gin 
Pro Glu Leu 



Ala Lys 

lie Ala 
245 

Asp Ser 
260 

Leu Asp 

Thr Arg 

Thr Arg 

Pro Tyr 
325 

Lys Ser 
340 

Asn Phe 

Ser Ala 

Gly Gly 

Arg Gin 
405 

Gly Glu 
420 

Ser His 
Asn lie 
Ser Pro 



Ser Pro 
485 

Glu His 
500 



val Thr 
230 

Lys Lys 

Lys Phe 

Thr Arg 

Leu Asn 
295 

Gin Gly 
310 

Asn val 

Glu Ser 

Ala Ala 

Ala Tyr 
375 

Ser Asp 
390 

His Phe 
Leu Ala 
Gin Leu 



Gly Tyr 
455 

Phe Asp 
470 

val Asp 



His Pro 



Pro Ala Pro Lys 
Val Ala 
Gin Arg 



Gin Asn 
535 



Gly Val 
565 

Asp Arg 
580 



Leu Ala 
550 

Gly Asp 
ser Gly 



Asp lie Lys 

Ala Asn Ser 
250 

val Arg lie 
265 

Leu Ala Ser 
280 

Gly Leu Asp 

Leu Ala Glu 

Gly Arg Phe 
330 

Ala val Ala 
345 

Lys Ala Gly 
360 

His val Gly 

Leu Ala Asn 

Glu Pro Asp 
410 

Glu Arg Ser 
425 

Gly Asn Leu 
440 

lie Val Arg 

Asn His Ala 

Gly Phe Ser 
490 

Ala Asp Gly 
505 

Gly Ala Arg 
520 

lie Arg Leu 
Asp Arg Phe 



Gly Phe Lys 
570 

Asn Ala Ala 
585 



Ala Asp lie 
235 

Ala Asp Val 

Asp Gly Leu 

Ala Glu Lys 
285 

Lys Thr Val 
300 

Gin Ala Ala 
315 

Asn val Thr 

lie Gly Thr 

val Ala val 
365 

Val Asn Tyr 
380 

Asp Ser Phe 
395 

Gly Lys Tyr 

Gly His lie 

Met lie Gin 
445 

Phe Ser Asp 
460 

Ser His ser 
475 

Leu Tyr Arg 

Tyr Asp Gly 

Asp lie Tyr 
525 

Asn Leu Thr 
540 

His Asn Ala 
555 

Arg Ala Thr 
Glu Ala Phe 



Ala Thr Asn 
240 

Tyr Thr Arg 
255 

Asn Ala Thr 
270 

Ser lie Ala 

Ser Asp Leu 

Leu Ser Gly 
320 

Ala Ala Val 
335 

Gly Phe Arg 
350 

Gly Thr Ser 

Glu Trp Gly 

lie Arg Gin 
400 

His Leu Phe 
415 

Gly Leu Gly 
430 

Gin Ala Ala 

His Gly His 

Asp Ser Asp 
480 

lie His Trp 

495 

Pro Gin Gly 
510 

Ser Tyr Asp 
Asp Asn Arg 



Gly Ser Met 
560 

Arg Tyr Ser 
575 

Asn Gly Thr 
590 
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Ala Asp lie Val Lys Asn He lie Gly Ala Ala Gly Glu He Val Gly 
595 600 605 

Ala Gly Asp Ala Val Gin Gly lie Ser Glu Gly Ser Asn lie Ala Val 
610 615 620 

Met His Gly Leu Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg lie 
625 630 635 640 

Asn Asp Leu Ala Asp Met Ala Gin Leu Lys Asp Tyr Ala Ala Ala Ala 
645 650 655 

lie Arg Asp Trp Ala Val Gin Asn Pro Asn Ala Ala Gin Gly lie Glu 
660 665 670 

Ala val Ser Asn lie Phe Met Ala Ala lie Pro He Lys Gly He Gly 
15 675 680 685 

Ala val Arg Gly Lys Tyr Gly Leu Gly Gly lie Thr Ala His Pro lie 
690 695 700 

Lys Arg ser Gin Met Gly Ala lie Ala Leu Pro Lys Gly Lys Ser Ala 

20 705 710 715 720 

val Ser Asp Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro 
725 730 735 

Tyr His Ser Arg Asn lie Arg Ser Asn Leu Glu Gin Arg Tyr Gly Lys 
25 740 745 750 

Glu Asn lie Thr Ser Ser Thr val Pro Pro Ser Asn Gly Lys Asn val 
755 760 765 



30 



35 



Lys Leu Ala Asp Gin Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly 
770 775 780 

Lys Gly Phe Pro Asn Phe Glu Lys His val Lys Tyr Asp Thr Leu Glu 
785 790 795 800 

His His His His His His 
805 

<210> 148 
<211> 1938 

<212> DNA 

<213> Artificial Sequence 

40 <220> 

<223> 961-741 

<400> 148 

atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60 

aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120 

45 gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180 

ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240 

aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300 

gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360 

ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420 

aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480 

aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540 

gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600 

gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660 

gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720 

aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780 

agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840 

gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900 

gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960 

ctgttccaac cttacaacgt gggtcggttc aatgtaacgg ctgcagtcgg cggctacaaa 1020 



50 



55 
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tccgaatcgg cagtcgccat cggtaccggc ttccgcttta ccgaaaactt tgccgccaaa 
gcaggcgtgg cagtcggcac ttcgtccggt tcttccgcag cctaccatgt cggcgtcaat 
tacgagtggg gatccggagg gggtggtgtc gccgccgaca tcggtgcggg gcttgccgat 
gcactaaccg caccgctcga ccataaagac aaaggtttgc agtctttgac gctggatcag 
tccgtcagga aaaacgagaa actgaagctg gcggcacaag gtgcggaaaa aacttatgga 
aacggtgaca gcctcaatac gggcaaattg aagaacgaca aggtcagccg tttcgacttt 
atccgccaaa tcgaagtgga cgggcagctc attaccttgg agagtggaga gttccaagta 
tacaaacaaa gccattccgc cttaaccgcc tttcagaccg agcaaataca agattcggag 
cattccggga agatggttgc gaaacgccag ttcagaatcg gcgacatagc gggcgaacat 
acatcttttg acaagcttcc cgaaggcggc agggcgacat atcgcgggac ggcgttcggt 
tcagacgatg ccggcggaaa actgacctac accatagatt tcgccgccaa gcagggaaac 
ggcaaaatcg aacatttgaa atcgccagaa ctcaatgtcg acctggccgc cgccgatatc 
aagccggatg gaaaacgcca tgccgtcatc agcggttccg tcctttacaa ccaagccgag 
aaaggcagtt actccctcgg tatctttggc ggaaaagccc aggaagttgc cggcagcgcg 
gaagtgaaaa ccgtaaacgg catacgccat atcggccttg ccgccaagca actcgagcac 
caccaccacc accactga 

<210> 149 

<211> 645 

<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 961-741 

<400> 149 

Met Ala Thr Asn Asp Asp Asp val Lys Lys Ala Ala Thr val Ala lie 
1 5 10 15 

Ala Ala Ala Tyr Asn Asn Gly Gin Glu lie Asn Gly Phe Lys Ala Gly 
20 25 30 

Glu Thr lie Tyr Asp lie Asp Glu Asp Gly Thr lie Thr Lys Lys Asp 

35 40 45 

Ala Thr Ala Ala Asp Val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 
50 55 60 

Lys Lys val Val Thr Asn Leu Thr Lys Thr val Asn Glu Asn Lys Gin 
65 70 75 80 

Asn Val Asp Ala Lys Val Lys Ala Ala Glu Ser Glu lie Glu Lys Leu 
85 90 95 

Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 
100 105 110 

Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn lie 
115 120 125 

Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn lie Val Lys lie Asp Glu 
130 135 140 

Lys Leu Glu Ala Val Ala Asp Thr Val Asp Lys His Ala Glu Ala Phe 
145 150 155 160 

Asn Asp lie Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 
165 170 175 

Ala val Lys Thr Ala Asn Glu Ala Lys Gin Thr Ala Glu Glu Thr Lys 
180 185 190 

Gin Asn val Asp Ala Lys val Lys Ala Ala Glu Thr Ala Ala Gly Lys 
195 200 205 

Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 
210 215 220 

Ala val Ala Ala Lys val Thr Asp lie Lys Ala Asp lie Ala Thr Asn 
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225 

Lys Asp Asn 

Glu Glu ser 

Thr Glu Lys 
275 

Asp His ASP 

290 

Arg Lys Glu 
305 

Leu Phe Gin 

Gly Gly Tyr 

Phe Thr Glu 
355 

Ser Gly Ser 
370 

Ser Gly Gly 
385 

Ala Leu Thr 

Thr Leu Asp 

Gin Gly Ala 
435 

Lys Leu Lys 
450 

Glu val Asp 
465 

Tyr Lys Gin 
Gin Asp ser 



lie Gly Asp 
515 

Gly Gly Arg 
530 

Gly Gly Lys 
545 



lie Ala 
245 

Asp Ser 
260 

Leu Asp 
Thr Arg 
Thr Arg 

Pro Tyr 

325 

Lys Ser 
340 

Asn Phe 

Ser Ala 

Gly Gly 

Ala Pro 
405 

Gin Ser 
420 

Glu Lys 

Asn Asp 

Gly Gin 

Ser His 
485 

Glu His 
500 

lie Ala 
Ala Thr 
Leu Thr 



Glu His 
565 



Gly Lys lie 
Ala Ala Asp 
Ser val Leu Tyr Asn 



He Lys 
580 



230 

Lys Lys 

Lys Phe 

Thr Arg 

Leu Asn 
295 

Gin Gly 
310 

Asn Val 

Glu Ser 

Ala Ala 

Ala Tyr 
375 

val Ala 
390 

Leu Asp 

val Arg 

Thr Tyr 

Lys val 
455 

Leu lie 
470 

Ser Ala 

Ser Gly 

Gly Glu 

Tyr Arg 
535 

Tyr Thr 
550 

Leu Lys 
Pro Asp 
Gin Ala 



Ala Asn ser 
250 

Val Arg lie 
265 

Leu Ala Ser 
280 

Gly Leu Asp 
Leu Ala Glu 

Gly Arg Phe 

330 

Ala val Ala 
345 

Lys Ala Gly 
360 

His Val Gly 

Ala Asp lie 

His Lys Asp 
410 

Lys Asn Glu 
425 

Gly Asn Gly 
440 

Ser Arg Phe 

Thr Leu Glu 

Leu Thr Ala 
490 

Lys wet val 
505 

His Thr Ser 
520 

Gly Thr Ala 
lie Asp Phe 



235 

Ala Asp val 

Asp Gly Leu 

Ala Glu Lys 
285 

Lys Thr val 
300 

Gin Ala Ala 
315 

Asn Val Thr 
He Gly Thr 

val Ala val 

365 

val Asn Tyr 
380 

Gly Ala Gly 
395 

Lys Gly Leu 

Lys Leu Lys 

Asp Ser Leu 
445 

Asp Phe lie 
460 

Ser Gly Glu 
475 

Phe Gin Thr 
Ala Lys Arg 



Phe Asp Lys 
525 

Phe Gly Ser 
540 

Ala Ala Lys 
555 



240 

Tyr Thr Arg 
255 

Asn Ala Thr 
270 

Ser lie Ala 

ser Asp Leu 

Leu Ser Gly 
320 

Ala Ala Val 
335 

Gly Phe Arg 
350 

Gly Thr Ser 

Glu Trp Gly 

Leu Ala Asp 
400 

Gin ser Leu 

415 

Leu Ala Ala 
430 

Asn Thr Gly 

Arg Gin lie 

Phe Gin val 
480 

Glu Gin lie 

495 

Gin Phe Arg 
510 

Leu Pro Glu 
Asp Asp Ala 



Gin Gly Asn 
560 



Leu Asn val 
His Ala Val 
Glu Lys Gly Ser Tyr Ser Leu Gly He 



Ser Pro Glu 
570 

Gly Lys Arg 
585 



Asp Leu Ala 
575 

lie Ser Gly 
590 
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595 



600 



605 



Phe Gly Gly Lys Ala Gin Glu val Ala Gly Ser Ala Glu Val Lys Thr 

610 615 620 

Val Asn Gly lie Arg His lie Gly Leu Ala Ala Lys Gin Leu Glu His 

625 630 635 640 



10 



15 



His His His His His 
645 

<210> 150 
<211> 4335 
<212> DNA 

<213> Artificial Sequence 



<220> 
<223> 



961-983 



20 



25 



30 



35 



40 



45 



50 



55 



<400> 150 

atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60 

aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120 

gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180 

ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240 

aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300 

gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360 

ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420 

aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480 

aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540 

gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600 

gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660 

gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720 

aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780 

agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840 

gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900 

gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960 

ctgttccaac cttacaacgt gggtcggttc aatgtaacgg ctgcagtcgg cggctacaaa 1020 

tccgaatcgg cagtcgccat cggtaccggc ttccgcttta ccgaaaactt tgccgccaaa 1080 

gcaggcgtgg cagtcggcac ttcgtccggt tcttccgcag cctaccatgt cggcgtcaat 1140 

tacgagtggg gatccggcgg aggcggcact tctgcgcccg acttcaatgc aggcggtacc 1200 

ggtatcggca gcaacagcag agcaacaaca gcgaaatcag cagcagtatc ttacgccggt 1260 

atcaagaacg aaatgtgcaa agacagaagc atgctctgtg ccggtcggga tgacgttgcg 1320 

gttacagaca gggatgccaa aatcaatgcc ccccccccga atctgcatac cggagacttt 1380 

ccaaacccaa atgacgcata caagaatttg atcaacctca aacctgcaat tgaagcaggc 1440 

tatacaggac gcggggtaga ggtaggtatc gtcgacacag gcgaatccgt cggcagcata 1500 

tcctttcccg aactgtatgg cagaaaagaa cacggctata acgaaaatta caaaaactat 1560 

acggcgtata tgcggaagga agcgcctgaa gacggaggcg gtaaagacat tgaagcttct 1620 

ttcgacgatg aggccgttat agagactgaa gcaaagccga cggatatccg ccacgtaaaa 1680 

gaaatcggac acatcgattt ggtctcccat attattggcg ggcgttccgt ggacggcaga 1740 

cctgcaggcg gtattgcgcc cgatgcgacg ctacacataa tgaatacgaa tgatgaaacc 1800 

aagaacgaaa tgatggttgc agccatccgc aatgcatggg tcaagctggg cgaacgtggc 1860 

gtgcgcatcg tcaataacag ttttggaaca acatcgaggg caggcactgc cgaccttttc 1920 

caaatagcca attcggagga gcagtaccgc caagcgttgc tcgactattc cggcggtgat 1980 

aaaacagacg agggtatccg cctgatgcaa cagagcgatt acggcaacct gtcctaccac 2040 

atccgtaata aaaacatgct tttcatcttt tcgacaggca atgacgcaca agctcagccc 2100 

aacacatatg ccctattgcc attttatgaa aaagacgctc aaaaaggcat tatcacagtc 2160 

gcaggcgtag accgcagtgg agaaaagttc aaacgggaaa tgtatggaga accgggtaca 2220 

gaaccgcttg agtatggctc caaccattgc ggaattactg ccatgtggtg cctgtcggca 2280 

ccctatgaag caagcgtccg tttcacccgt acaaacccga ttcaaattgc cggaacatcc 2340 

ttttccgcac ccatcgtaac cggcacggcg gctctgctgc tgcagaaata cccgtggatg 2400 

agcaacgaca acctgcgtac cacgttgctg acgacggctc aggacatcgg tgcagtcggc 2460 

gtggacagca agttcggctg gggactgctg gatgcgggta aggccatgaa cggacccgcg 2520 

tcctttccgt tcggcgactt taccgccgat acgaaaggta catccgatat tgcctactcc 2580 

ttccgtaacg acatttcagg cacgggcggc ctgatcaaaa aaggcggcag ccaactgcaa 2640 

ctgcacggca acaacaccta tacgggcaaa accattatcg aaggcggttc gctggtgttg 2700 

tacggcaaca acaaatcgga tatgcgcgtc gaaaccaaag gtgcgctgat ttataacggg 2760 

gcggcatccg gcggcagcct gaacagcgac ggcattgtct atctggcaga taccgaccaa 2820 

tccggcgcaa acgaaaccgt acacatcaaa ggcagtctgc agctggacgg caaaggtacg 2880 

ctgtacacac gtttgggcaa actgctgaaa gtggacggta cggcgattat cggcggcaag 2940 
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ctgtacatgt cggcacgcgg caagggggca ggctatctca acagtaccgg acgacgtgtt 3000 

cccttcctga gtgccgccaa aatcgggcag gattattctt tcttcacaaa catcgaaacc 3060 

gacggcggcc tgctggcttc cctcgacagc gtcgaaaaaa cagcgggcag tgaaggcgac 3120 

acgctgtcct attatgtccg tcgcggcaat gcggcacgga ctgcttcggc agcggcacat 3180 

5 tccgcgcccg ccggtctgaa acacgccgta gaacagggcg gcagcaatct ggaaaacctg 3240 

atggtcgaac tggatgcctc cgaatcatcc gcaacacccg agacggttga aactgcggca 3300 

gccgaccgca cagatatgcc gggcatccgc ccctacggcg caactttccg cgcagcggca 3360 

gccgtacagc atgcgaatgc cgccgacggt gtacgcatct tcaacagtct cgccgctacc 3420 

gtctatgccg acagtaccgc cgcccatgcc gatatgcagg gacgccgcct gaaagccgta 3480 

tcggacgggt tggaccacaa cggcacgggt ctgcgcgtca tcgcgcaaac ccaacaggac 3540 

w ggtggaacgt gggaacaggg cggtgttgaa ggcaaaatgc gcggcagtac ccaaaccgtc 3600 

ggcattgccg cgaaaaccgg cgaaaatacg acagcagccg ccacactggg catgggacgc 3660 

agcacatgga gcgaaaacag tgcaaatgca aaaaccgaca gcattagtct gtttgcaggc 3720 

atacggcacg atgcgggcga tatcggctat ctcaaaggcc tgttctccta cggacgctac 3780 

aaaaacagca tcagccgcag caccggtgcg gacgaacatg cggaaggcag cgtcaacggc 3840 

acgctgatgc agctgggcgc actgggcggt gtcaacgttc cgtttgccgc aacgggagat 3900 

15 ttgacggtcg aaggcggtct gcgctacgac ctgctcaaac aggatgcatt cgccgaaaaa 3960 

ggcagtgctt tgggctggag cggcaacagc ctcactgaag gcacgctggt cggactcgcg 4020 

ggtctgaagc tgtcgcaacc cttgagcgat aaagccgtcc tgtttgcaac ggcgggcgtg 4080 

gaacgcgacc tgaacggacg cgactacacg gtaacgggcg gctttaccgg cgcgactgca 4140 

gcaaccggca agacgggggc acgcaatatg ccgcacaccc gtctggttgc cggcctgggc 4200 

gcggatgtcg aattcggcaa cggctggaac ggcttggcac gttacagcta cgccggttcc 4260 

aaacagtacg gcaaccacag cggacgagtc ggcgtaggct accggttcct cgagcaccac 4320 

caccaccacc actga " " 4335 

<210> 151 

<211> 1444 

<212> PRT 

<213> Artificial Sequence 

25 

<220> 

<223> 961-983 

<400> 151 

Met Ala Thr Asn Asp Asp Asp val Lys Lys Ala Ala Thr val Ala lie 

so 1 5 10 15 

Ala Ala Ala Tyr Asn Asn Gly Gin Glu lie Asn Gly Phe Lys Ala Gly 
20 25 30 



20 



35 



40 



45 



50 



55 



Glu Thr He Tyr Asp lie Asp Glu Asp Gly Thr lie Thr Lys Lys Asp 
35 40 45 

Ala Thr Ala Ala Asp val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 
50 55 60 

Lys Lys val val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gin 
65 70 75 80 

Asn val Asp Ala Lys val Lys Ala Ala Glu ser Glu He Glu Lys Leu 
85 90 95 

Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 
100 105 110 

Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn lie 
115 120 125 

Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn lie Val Lys lie Asp Glu 
130 135 140 

Lys Leu Glu Ala val Ala Asp Thr val Asp Lys His Ala Glu Ala Phe 
145 150 155 160 

Asn Asp lie Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 
165 170 175 

Ala Val Lys Thr Ala Asn Glu Ala Lys Gin Thr Ala Glu Glu Thr Lys 
180 185 190 
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Gin Asn val Asp Ala Lys Val Lys Ala Ala Glu Thr Ala Ala Gly Lys 
195 200 205 

Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 
210 215 220 

Ala val Ala Ala Lys val Thr Asp lie Lys Ala Asp lie Ala Thr Asn 

225 230 235 240 

Lys Asp Asn lie Ala Lys Lys Ala Asn Ser Ala Asp val Tyr Thr Arg 
245 250 255 

Glu Glu Ser Asp Ser Lys Phe val Arg lie Asp Gly Leu Asn Ala Thr 
260 265 270 

Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser lie Ala 
275 280 285 

Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr val Ser Asp Leu 
290 295 300 

Arg Lys Glu Thr Arg Gin Gly Leu Ala Glu Gin Ala Ala Leu Ser Gly 
305 310 315 320 

Leu Phe Gin Pro Tyr Asn Val Gly Arg Phe Asn val Thr Ala Ala Val 
325 330 335 

Gly Gly Tyr Lys Ser Glu Ser Ala Val Ala lie Gly Thr Gly Phe Arg 
340 345 350 

Phe Thr Glu Asn Phe Ala Ala Lys Ala Gly val Ala Val Gly Thr Ser 
355 360 365 

Ser Gly Ser Ser Ala Ala Tyr His val Gly Val Asn Tyr Glu Trp Gly 
370 375 380 

Ser Gly Gly Gly Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr 
385 390 395 400 

Gly lie Gly Ser Asn Ser Arg Ala Thr Thr Ala Lys Ser Ala Ala Val 
405 410 415 

ser Tyr Ala Gly lie Lys Asn Glu Met cys Lys Asp Arg ser Met Leu 
420 425 430 

Cys Ala Gly Arg Asp Asp val Ala val Thr Asp Arg Asp Ala Lys lie 
435 440 445 

Asn Ala Pro Pro Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn 
450 455 460 

Asp Ala Tyr Lys Asn Leu lie Asn Leu Lys Pro Ala lie Glu Ala Gly 
465 470 475 480 

Tyr Thr Gly Arg Gly Val Glu Val Gly lie val Asp Thr Gly Glu Ser 
485 490 495 

Val Gly Ser lie Ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly 
500 505 510 

Tyr Asn Glu Asn Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala 
515 520 52 5 

Pro Glu Asp Gly Gly Gly Lys Asp lie Glu Ala Ser Phe Asp Asp Glu 
530 535 540 

Ala val lie Glu Thr Glu Ala Lys Pro Thr Asp lie Arg His val Lys 
545 550 555 560 
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Glu lie Gly 

val Asp Gly 

lie Met Asn 
595 

lie Arg Asn 
610 

Asn Asn ser 
625 

Gin lie Ala 

Ser Gly Gly 

Asp Tyr Gly 
675 

lie Phe Ser 
690 

Leu Leu pro 

705 

Ala Gly Val 

Glu Pro Gly 

Thr Ala Met 
755 

Thr Arg Thr 
770 

lie Val Thr 
785 

ser Asn Asp 

Gly Ala Val 

Gly Lys Ala 
835 

Ala Asp Thr 
850 

lie Ser Gly 
865 

Leu His Gly 
ser Leu val 



Lys Gly Ala 
915 



His lie Asp 
565 

Arg Pro Ala 
580 

Thr Asn Asp 

Ala Trp val 

Phe Gly Thr 
630 

Asn ser Glu 
645 

Asp Lys Thr 
660 

Asn Leu Ser 

Thr Gly Asn 

Phe Tyr Glu 
710 

Asp Arg Ser 
725 

Thr Glu Pro 
740 

Trp cys Leu 

Asn Pro lie 

Gly Thr Ala 
790 

Asn Leu Arg 
805 

Gly Val Asp 
820 

Met Asn Gly 
Lys Gly Thr 



Thr Gly Gly 
870 

Asn Asn Thr 
885 

Leu Tyr Gly 
900 

Leu lie Tyr 



Leu Val 

Gly Gly 

Glu Thr 
600 

Lys Leu 
615 

Thr ser 

Glu Gin 

Asp Glu 

Tyr His 
680 

Asp Ala 
695 

Lys Asp 
Gly Glu 
Leu Glu 



ser Ala 
760 

Gin He 

775 



Ser His 
570 

lie Ala 
585 

Lys Asn 

Gly Glu 

Arg Ala 

Tyr Arg 
650 

Gly lie 
665 

lie Arg 

Gin Ala 

Ala Gin 

Lys Phe 
730 

Tyr Gly 
745 

Pro Tyr 
Ala Gly 



lie lie Gly Gly 
pro Asp 
Glu Met 



Ala Thr 
590 



Arg Gly 
620 

Gly Thr 

635 



Met val 
605 

Val Arg 
Ala Asp 



Gin Ala Leu Leu 
Arg Leu 
Asn Lys 



Met Gin 
670 



Ala Leu Leu Leu 
Thr Thr 
Ser Lys 



Leu Leu 

810 



Pro Ala 
840 

ser Asp 
855 

Leu lie 
Tyr Thr 



Phe Gly 
82 5 

Ser Phe 



Gin Pro 
700 

Lys Gly 
715 

Lys Arg 

Ser Asn 

Glu Ala 

Thr Ser 
780 

Gin Lys 
795 

Thr Thr 
Trp Gly 
Pro Phe 



Asn Met 
685 

Asn Thr 
lie lie 
Glu Met 



Arg Ser 
575 

Leu His 
Ala Ala 
lie val 

Leu Phe 

640 

Asp Tyr 
655 

Gin Ser 
Leu Phe 
Tyr Ala 



Thr val 

720 

Tyr Gly 
735 



Gly lie 
Arg Phe 
Phe Ser Ala Pro 



His Cys 
750 

ser val 
765 



lie Ala Tyr Ser 
860 

Lys Lys Gly Gly 
875 

Gly Lys Thr lie 
890 



Asn Asn Lys Ser Asp Met 
905 



Asn Gly 
920 



Ala Ala Ser Gly 



Tyr Pro 

Ala Gin 

Leu Leu 
830 

Gly Asp 
845 

Phe Arg 
Ser Gin 
lie Glu 



Arg Val 
910 

Gly ser 
925 



Trp Met 
800 

Asp lie 
815 

Asp Ala 

Phe Thr 

Asn Asp 

Leu Gin 
880 

Gly Gly 
895 

Glu Thr 
Leu Asn 
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Ser Asp Gly lie val Tyr Leu Ala Asp Thr Asp Gin ser Gly Ala Asn 
930 935 940 

Glu Thr val His lie Lys Gly ser Leu Gin Leu Asp Gly Lys Gly Thr 
945 950 955 960 

Leu Tyr Thr Arg Leu Gly Lys Leu Leu Lys val Asp Gly Thr Ala lie 
965 970 975 

lie Gly Gly Lys Leu Tyr Met Ser Ala Arg Gly Lys Gly Ala Gly Tyr 
980 985 990 

Leu Asn Ser Thr Gly Arg Arg Val Pro Phe Leu Ser Ala Ala Lys lie 
995 1000 1005 

Gly Gin Asp Tyr ser Phe Phe Thr Asn lie Glu Thr Asp Gly Gly Leu 
1010 1015 1020 

Leu Ala ser Leu Asp ser val Glu Lys Thr Ala Gly ser Glu Gly Asp 
1025 1030 1035 1040 

Thr Leu Ser Tyr Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser 
1045 1050 1055 

Ala Ala Ala His Ser Ala Pro Ala Gly Leu Lys His Ala val Glu Gin 
1060 1065 1070 

Gly Gly Ser Asn Leu Glu Asn Leu Met Val Glu Leu Asp Ala Ser Glu 
1075 1080 1085 

Ser ser Ala Thr Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr 
1090 1095 1100 

Asp Met Pro Gly lie Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala 
1105 1110 1115 ~ 1120 

Ala val Gin His Ala Asn Ala Ala Asp Gly Val Arg lie Phe Asn Ser 
1125 1130 1135 

Leu Ala Ala Thr Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met 
1140 1145 1150 

Gin Gly Arg Arg Leu Lys Ala val Ser Asp Gly Leu Asp His Asn Gly 
1155 1160 1165 

Thr Gly Leu Arg val lie Ala Gin Thr Gin Gin Asp Gly Gly Thr Trp 
1170 1175 1180 

Glu Gin Gly Gly Val Glu Gly Lys Met Arg Gly Ser Thr Gin Thr Val 
1185 1190 1195 1200 

Gly lie Ala Ala Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu 
1205 1210 1215 

Gly Met Gly Arg ser Thr Trp ser Glu Asn ser Ala Asn Ala Lys Thr 
1220 1225 1230 

Asp Ser lie Ser Leu Phe Ala Gly lie Arg His Asp Ala Gly Asp lie 
1235 1240 1245 

Gly Tyr Leu Lys Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser lie 
1250 ' 1255 1260 

Ser Arg ser Thr Gly Ala Asp Glu His Ala Glu Gly Ser Val Asn Gly 
1265 1270 1275 1280 

Thr Leu Met Gin Leu Gly Ala Leu Gly Gly val Asn val Pro Phe Ala 
1285 1290 1295 
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aagatggcgc gcatcaacga tttggcagat atggcgcaac tcaaagacta tgccgcagca 1800 

gccatccgcg attgggcagt ccaaaacccc aatgccgcac aaggcataga agccgtcagc 1860 

aatatcttta tggcagccat ccccatcaaa gggattggag ctgttcgggg aaaatacggc 1920 

ttgggcggca tcacggcaca tcctatcaag cggtcgcaga tgggcgcgat cgcattgccg 1980 

aaagggaaat ccgccgtcag cgacaatttt gccgatgcgg catacgccaa atacccgtcc 2040 

ccttaccatt cccgaaatat ccgttcaaac ttggagcagc gttacggcaa agaaaacatc 2100 

acctcctcaa ccgtgccgcc gtcaaacggc aaaaatgtca aactggcaga ccaacgccac 2160 

ccgaagacag gcgtaccgtt tgacggtaaa gggtttccga attttgagaa gcacgtgaaa 2220 

tatgatacgc tcgagcacca ccaccaccac cactga ~ 2256 



10 


<210> 
<211> 
<212> 
<213> 


153 
751 
PRT 

Arti f i ci al Sequence 




















15 


<220> 
<223> 


961c-ORF46.1 
























<400> 
Met Ala 
1 


153 
Thr 


Asn 


Asp 

5 


Asp 


Asp 


val 


Lys 


Lys 
10 


Ala 


Ala 


Thr 


val 


Ala 
15 


He 


20 


Ala 


Ala 


Ala 


Tyr 
20 


Asn 


Asn 


Gly 


Gin 


Glu 
25 


He 


Asn 


Gly 


Phe 


Lys 
30 


Ala 


Gly 




Glu 


Thr 


lie 
35 


Tyr 


Asp 


He 


Asp 


Glu 
40 


Asp 


Gly 


Thr 


He 


Thr 
45 


Lys 


Lys 


Asp 


25 


Ala 


Thr 
50 


Ala 


Ala 


Asp 


val 


Glu 

55 


Ala 


Asp 


Asp 


Phe 


Lys 
60 


Gly 


Leu 


Gly 


Leu 




Lys 
65 


Lys 


val 


val 


Thr 


Asn 
70 


Leu 


Thr 


Lys 


Thr 


Val 

75 


Asn 


Glu 


Asn 


Lys 


Gin 
80 


30 


Asn 


val 


Asp 


Ala 


Lys 
85 


val 


Lys 


Ala 


Ala 


Glu 

90 


Ser 


GlU 


He 


GlU 


Lys 

95 


Leu 




Thr 


Thr 


Lys 


Leu 
100 


Ala 


Asp 


Thr 


ASp 


Ala 
105 


Ala 


Leu 


Ala 


ASp 


Thr 
110 


ASp 


Ala 


35 


Ala 


Leu 


Asp 
115 


Ala 


Thr 


Thr 


Asn 


Ala 
120 


Leu 


Asn 


Lys 


Leu 


Gly 
125 


Glu 


Asn 


He 




Thr 


Thr 
130 


Phe 


Ala 


Glu 


Glu 


Thr 

13 5 


Lys 


Thr 


Asn 


He 


val 
140 


Lys 


He 


Asp 


Glu 


40 


Lys 
145 


Leu 


Glu 


Ala 


val 


Ala 
150 


Asp 


Thr 


Val 


Asp 


Lys 
155 


His 


Ala 


Glu 


Ala 


Phe 
160 




Asn 


Asp 


He 


Ala 


Asp 

165 


Ser 


Leu 


Asp 


Glu 


Thr 

170 


Asn 


Thr 


Lys 


Ala 


Asp 

175 


Glu 


45 


Ala 


val 


Lys 


Thr 
180 


Ala 


Asn 


GlU 


Ala 


Lys 
185 


Gin 


Thr 


Ala 


GlU 


Glu 
190 


Thr 


Lys 


Gin 


Asn 


val 
195 


ASP 


Ala 


Lys 


val 


Lys 
200 


Ala 


Ala 


GlU 


Thr 


Ala 
205 


Ala 


Gly 


Lys 


50 


Ala 


Glu 

210 


Ala 


Ala 


Ala 


Gly 


Thr 
215 


Ala 


Asn 


Thr 


Ala 


Ala 
220 


Asp 


Lys 


Ala 


GlU 


Ala 
225 


val 


Ala 


Ala 


Lys 


val 
230 


Thr 


Asp 


He 


Lys 


Ala 
235 


Asp 


He 


Ala 


Thr 


Asn 
240 




Lys 


Asp 


Asn 


lie 


Ala 

245 


Lys 


Lys 


Ala 


Asn 


Ser 
250 


Ala 


Asp 


Val 


Tyr 


Thr 
255 


Arg 


55 


Glu 


Glu 


ser 


Asp 

260 


Ser 


Lys 


Phe 


val 


Arg 
265 


He 


Asp 


Gly 


Leu 


Asn 
270 


Ala 


Thr 
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Thr Glu 

Asp His 
290 

Arg Lys 
305 

Leu Phe 

Leu Ala 

Glu Pro 

Glu Arg 
370 

Gly Asn 
385 

lie val 

Asn His 

Gly Phe 

Ala Asp 
450 

Gly Ala 
465 

lie Arg 

Asp Arg 

Gly Phe 

Asn Ala 
530 

lie Gly 
545 

lie ser 
Ser Thr 
Gin Leu 



Asn Pro 
610 

Ala Ala 

625 



Lys Leu 

275 

Asp Thr 

Glu Thr 

Gin Pro 

Asn Asp 
340 

Asp Gly 
355 

Ser Gly 

Leu Met 

Arg Phe 

Ala Ser 
420 

Ser Leu 
435 

Gly Tyr 

Arg Asp 

Leu Asn 

Phe His 
500 

Lys Arg 
515 

Ala Glu 

Ala Ala 

Glu Gly 

Glu Asn 
580 

Lys Asp 
595 

Asn Ala 
lie Pro 



Asp Thr 

Arg Leu 

Arg Gin 
310 

Tyr Asn 
325 

Ser Phe 

Lys Tyr 

His lie 

lie Gin 
390 

Ser Asp 
405 

His Ser 

Tyr Arg 

Asp Gly 

lie Tyr 
470 

Leu Thr 
485 

Asn Ala 

Ala Thr 

Ala Phe 

Gly Glu 
550 

Ser Asn 
565 

Lys Met 
Tyr Ala 
Ala Gin 



He Lys 
630 



Arg Leu 
280 

Asn Gly 
295 

Gly Leu 
val Gly 
lie Arg 



Ala Ser Ala Glu 
Leu Asp 
Ala Glu 



Lys Thr 
300 



Gly Ser 
330 

Gin val 
345 



Phe Gly 
Gly Lys 
Gin Ala Ala lie 



His Leu 
360 

Gly Leu 
375 



His Gly His Glu 
410 

Asp Ser Asp Glu 
425 

lie His Trp Asp 
440 

Pro Gin Gly Gly 
455 

Ser Tyr Asp lie 



Gin Ala 
315 

Gly Gly 
Leu Asp 
Ser Arg 

lie Gin 

380 

Lys Gly 
395 

Val His 
Ala Gly 
Gly Tyr 



Gly Tyr 
460 

Lys Gly 
475 



Lys Ser lie Ala 
285 

Val Ser Asp Leu 



Ala Leu Ser Gly 
320 

Gly Gly ser Asp 
335 

Arg Gin His Phe 
350 

Gly Glu Leu Ala 
365 

Ser His Gin Leu 



Asp Asn 

Gly Ser 

Arg Tyr 
520 

Asn Gly 
535 

lie val 
He Ala 
Ala Arg 



Thr Gly 
Thr Gin 
ser pro Glu Leu 



Arg Ser 
490 

Met Leu 

505 



Thr Ala 
Gly Ala 



val Met 
570 

lie Asn 
585 



Ala Ala 
600 



Ala lie 
Glu Ala 
Gly lie Gly Ala 



Gly lie 
615 



Asp lie 
540 

Gly Asp 
555 

His Gly 

Asp Leu 

Arg Asp 

Val Ser 
620 

Val Arg 
635 



Asn lie 

Ser Pro 

Ser Pro 
430 

Glu His 
445 

pro Ala 

val Ala 

Gin Arg 

Gly val 
510 

Asp Arg 
525 

val Lys 
Ala val 
Leu Gly 



Gly Tyr 
400 

Phe Asp 
415 

val Asp 

His Pro 

pro Lys 

Gin Asn 
480 

Leu Ala 
495 

Gly Asp 
Ser Gly 
Asn lie 



Gin Gly 
560 

Leu Leu 

575 



Ala Asp 
590 

Trp Ala 
605 



Met Ala 
val Gin 
Asn lie Phe Met 



Gly Lys Tyr Gly 
640 
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Leu Gly Gly lie Thr Ala His Pro lie Lys Arg Ser Gin Met Gly Ala 
645 650 655 

lie Ala Leu Pro Lys Gly Lys Ser Ala val Ser Asp Asn Phe Ala Asp 
5 660 665 670 

Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr His ser Arg Asn lie Arg 
675 680 685 

Ser Asn Leu Glu Gin Arg Tyr Gly Lys Glu Asn lie Thr ser Ser Thr 
10 690 695 700 

val Pro Pro Ser Asn Gly Lys Asn val Lys Leu Ala Asp Gin Arg His 
705 710 715 720 

Pro Lys Thr Gly Val Pro Phe Asp Gly Lys Gly Phe Pro Asn Phe Glu 

15 725 730 735 

Lys His val Lys Tyr Asp Thr Leu Glu His His His His His His 

740 745 750 

<210> 154 
20 <211> 1773 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 961C-741 
<400> 154 

atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 60 

aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 120 

gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 180 

ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 240 

aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 300 

gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 360 

ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 420 

aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 480 

aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 540 

gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 600 

gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 660 

35 gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 720 

aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 780 

agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 840 

gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 900 

gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 960 

ctgttccaac cttacaacgt gggtggatcc ggagggggtg gtgtcgccgc cgacatcggt 1020 

40 gcggggcttg ccgatgcact aaccgcaccg ctcgaccata aagacaaagg tttgcagtct 1080 

ttgacgctgg atcagtccgt caggaaaaac gagaaactga agctggcggc acaaggtgcg 1140 

gaaaaaactt atggaaacgg tgacagcctc aatacgggca aattgaagaa cgacaaggtc 1200 

agccgtttcg actttatccg ccaaatcgaa gtggacgggc agctcattac cttggagagt 1260 

ggagagttcc aagtatacaa acaaagccat tccgccttaa ccgcctttca gaccgagcaa 1320 

atacaagatt cggagcattc cgggaagatg gttgcgaaac gccagttcag aatcggcgac 1380 

45 atagcgggcg aacatacatc ttttgacaag cttcccgaag gcggcagggc gacatatcgc 1440 

gggacggcgt tcggttcaga cgatgccggc ggaaaactga cctacaccat agatttcgcc 1500 

gccaagcagg gaaacggcaa aatcgaacat ttgaaatcgc cagaactcaa tgtcgacctg 1560 

gccgccgccg atatcaagcc ggatggaaaa cgccatgccg tcatcagcgg ttccgtcctt 1620 

tacaaccaag ccgagaaagg cagttactcc ctcggtatct ttggcggaaa agcccaggaa 1680 

gttgccggca gcgcggaagt gaaaaccgta aacggcatac gccatatcgg ccttgccgcc 1740 

50 aagcaactcg agcaccacca ccaccaccac tga 1773 

<210> 155 

<211> 590 

<212> PRT 

<213> Artificial Sequence 

55 <220> 

<223> 961C-741 
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<400> 155 

Met Ala Thr Asn Asp Asp Asp val Lys Lys Ala Ala Thr val Ala lie 
1 5 10 15 

Ala Ala Ala Tyr Asn Asn Gly Gin Glu lie Asn Gly Phe Lys Ala Gly 
20 25 30 

Glu Thr lie Tyr Asp lie Asp Glu Asp Gly Thr lie Thr Lys Lys Asp 
35 40 45 

Ala Thr Ala Ala Asp val Glu Ala Asp Asp Phe Lys Gly Leu Gly Leu 
50 55 60 

Lys Lys val Val Thr Asn Leu Thr Lys Thr Val Asn Glu Asn Lys Gin 
65 70 75 80 

Asn val Asp Ala Lys val Lys Ala Ala Glu Ser Glu lie Glu Lys Leu 
85 90 95 

Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala Leu Ala Asp Thr Asp Ala 
100 105 110 

Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn Lys Leu Gly Glu Asn lie 
115 120 125 

Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn He Val Lys He Asp Glu 
130 135 140 

Lys Leu Glu Ala val Ala Asp Thr val Asp Lys His Ala Glu Ala Phe 
145 150 155 160 

Asn Asp lie Ala Asp Ser Leu Asp Glu Thr Asn Thr Lys Ala Asp Glu 
165 170 175 

Ala val Lys Thr Ala Asn Glu Ala Lys Gin Thr Ala Glu Glu Thr Lys 
180 185 190 

Gin Asn val Asp Ala Lys val Lys Ala Ala Glu Thr Ala Ala Gly Lys 
195 200 205 

Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr Ala Ala Asp Lys Ala Glu 
210 215 220 

Ala Val Ala Ala Lys Val Thr Asp lie Lys Ala Asp lie Ala Thr Asn 
225 230 235 240 

Lys Asp Asn lie Ala Lys Lys Ala Asn Ser Ala Asp val Tyr Thr Arg 
245 " 250 255 

Glu Glu Ser Asp Ser Lys Phe val Arg lie Asp Gly Leu Asn Ala Thr 
260 265 270 

Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser Ala Glu Lys Ser lie Ala 
275 280 285 

Asp His Asp Thr Arg Leu Asn Gly Leu Asp Lys Thr Val Ser Asp Leu 
290 295 300 

Arg Lys Glu Thr Arg Gin Gly Leu Ala Glu Gin Ala Ala Leu Ser Gly 
305 310 315 320 

Leu Phe Gin Pro Tyr Asn Val Gly Gly Ser Gly Gly Gly Gly Val Ala 
325 330 335 

Ala Asp lie Gly Ala Gly Leu Ala Asp Ala Leu Thr Ala Pro Leu Asp 
340 345 3 50 

His Lys Asp Lys Gly Leu Gin ser Leu Thr Leu Asp Gin Ser val Arg 
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355 360 365 

Lys Asn Glu Lys Leu Lys Leu Ala Ala Gin Gly Ala Glu Lys Thr Tyr 
370 375 380 

Gly Asn Gly Asp ser Leu Asn Thr Gly Lys Leu Lys Asn Asp Lys val 
385 390 395 400 

Ser Arg Phe Asp Phe lie Arg Gin lie Glu val Asp Gly Gin Leu lie 
405 410 415 

Thr Leu Glu Ser Gly Glu Phe Gin val Tyr Lys Gin ser His ser Ala 
420 425 430 

Leu Thr Ala Phe Gin Thr Glu Gin lie Gin Asp Ser Glu His Ser Gly 
435 440 445 

Lys Met val Ala Lys Arg Gin Phe Arg lie Gly Asp lie Ala Gly Glu 
450 455 460 

His Thr Ser Phe Asp Lys Leu Pro Glu Gly Gly Arg Ala Thr Tyr Arg 
465 470 475 480 

Gly Thr Ala Phe Gly Ser Asp Asp Ala Gly Gly Lys Leu Thr Tyr Thr 
485 490 495 

lie Asp Phe Ala Ala Lys Gin Gly Asn Gly Lys He Glu His Leu Lys 
500 505 510 

Ser Pro Glu Leu Asn Val Asp Leu Ala Ala Ala Asp lie Lys Pro Asp 
515 520 525 

Gly Lys Arg His Ala val lie ser Gly ser val Leu Tyr Asn Gin Ala 
530 53 5 540 

Glu Lys Gly ser Tyr ser Leu Gly lie Phe Gly Gly Lys Ala Gin Glu 
545 550 555 560 

val Ala Gly Ser Ala Glu val Lys Thr Val Asn Gly lie Arg His lie 
565 570 575 

Gly Leu Ala Ala Lys Gin Leu Glu His His His His His His 
580 585 590 

<210> 156 
<211> 4170 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 961C-983 
<400> 156 

atggccacaa acgacgacga tgttaaaaaa gctgccactg tggccattgc tgctgcctac 
aacaatggcc aagaaatcaa cggtttcaaa gctggagaga ccatctacga cattgatgaa 
gacggcacaa ttaccaaaaa agacgcaact gcagccgatg ttgaagccga cgactttaaa 
ggtctgggtc tgaaaaaagt cgtgactaac ctgaccaaaa ccgtcaatga aaacaaacaa 
aacgtcgatg ccaaagtaaa agctgcagaa tctgaaatag aaaagttaac aaccaagtta 
gcagacactg atgccgcttt agcagatact gatgccgctc tggatgcaac caccaacgcc 
ttgaataaat tgggagaaaa tataacgaca tttgctgaag agactaagac aaatatcgta 
aaaattgatg aaaaattaga agccgtggct gataccgtcg acaagcatgc cgaagcattc 
aacgatatcg ccgattcatt ggatgaaacc aacactaagg cagacgaagc cgtcaaaacc 
gccaatgaag ccaaacagac ggccgaagaa accaaacaaa acgtcgatgc caaagtaaaa 
gctgcagaaa ctgcagcagg caaagccgaa gctgccgctg gcacagctaa tactgcagcc 
gacaaggccg aagctgtcgc tgcaaaagtt accgacatca aagctgatat cgctacgaac 
aaagataata ttgctaaaaa agcaaacagt gccgacgtgt acaccagaga agagtctgac 
agcaaatttg tcagaattga tggtctgaac gctactaccg aaaaattgga cacacgcttg 
gcttctgctg aaaaatccat tgccgatcac gatactcgcc tgaacggttt ggataaaaca 
gtgtcagacc tgcgcaaaga aacccgccaa ggccttgcag aacaagccgc gctctccggt 
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10 



15 



20 



25 



30 



35 



40 



45 



ctgttccaac 
aatgcaggcg 
gtatcttacg 
cgggatgacg 
cataccggag 
gcaattgaag 
tccgtcggca 
aattacaaaa 
gacattgaag 
atccgccacg 
tccgtggacg 
acgaatgatg 
ctgggcgaac 
actgccgacc 
tattccggcg 
aacctgtcct 
gcacaagctc 
ggcattatca 
ggagaaccgg 
tggtgcctgt 
attgccggaa 
aaatacccgt 
atcggtgcag 
atgaacggac 
gatattgcct 
ggcagccaac 
ggttcgctgg 
ctgatttata 
gcagataccg 
gacggcaaag 
attatcggcg 
accggacgac 
acaaacatcg 
ggcagtgaag 
tcggcagcgg 
aatctggaaa 
gttgaaactg 
ttccgcgcag 
agtctcgccg 
cgcctgaaag 
caaacccaac 
agtacccaaa 
ctgggcatgg 
agtctgtttg 
tcctacggac 
ggcagcgtca 
gccgcaacgg 
gcattcgccg 
ctggtcggac 
gcaacggcgg 
accggcgcga 
gttgccggcc 
agctacgccg 
ttcctcgagc 



cttacaacgt 
gtaccggtat 
ccggtatcaa 
ttgcggttac 
actttccaaa 
caggctatac 
gcatatcctt 
actatacggc 
cttctttcga 
taaaagaaat 
gcagacctgc 
aaaccaagaa 
gtggcgtgcg 
ttttccaaat 
gtgataaaac 
accacatccg 
agcccaacac 
cagtcgcagg 
gtacagaacc 
cggcacccta 
catccttttc 
ggatgagcaa 
tcggcgtgga 
ccgcgtcctt 
actccttccg 
tgcaactgca 
tgttgtacgg 
acggggcggc 
accaatccgg 
gtacgctgta 
gcaagctgta 
gtgttccctt 
aaaccgacgg 
gcgacacgct 
cacattccgc 
acctgatggt 
cggcagccga 
cggcagccgt 
ctaccgtcta 
ccgtatcgga 
aggacggtgg 
ccgtcggcat 
gacgcagcac 
caggcatacg 
gctacaaaaa 
acggcacgct 
gagatttgac 
aaaaaggcag 
tcgcgggtct 
gcgtggaacg 
ctgcagcaac 
tgggcgcgga 
gttccaaaca 
accaccacca 



gggtggatcc 
cggcagcaac 
gaacgaaatg 
agacagggat 
cccaaatgac 
aggacgcggg 
tcccgaactg 
gtatatgcgg 
cgatgaggcc 
cggacacatc 
aggcggtatt 
cgaaatgatg 
catcgtcaat 
agccaattcg 
agacgagggt 
taataaaaac 
atatgcccta 
cgtagaccgc 
gcttgagtat 
tgaagcaagc 
cgcacccatc 
cgacaacctg 
cagcaagttc 
tccgttcggc 
taacgacatt 
cggcaacaac 
caacaacaaa 
atccggcggc 
cgcaaacgaa 
cacacgtttg 
catgtcggca 
cctgagtgcc 
cggcctgctg 
gtcctattat 
gcccgccggt 
cgaactggat 
ccgcacagat 
acagcatgcg 
tgccgacagt 
cgggttggac 
aacgtgggaa 
tgccgcgaaa 
atggagcgaa 
gcacgatgcg 
cagcatcagc 
gatgcagctg 
ggtcgaaggc 
tgctttgggc 
gaagctgtcg 
cgacctgaac 
cggcaagacg 
tgtcgaattc 
gtacggcaac 
ccaccactga 



ggcggaggcg 
agcagagcaa 
tgcaaagaca 
gccaaaatca 
gcatacaaga 
gtagaggtag 
tatggcagaa 
aaggaagcgc 
gttatagaga 
gatttggtct 
gcgcccgatg 
gttgcagcca 
aacagttttg 
gaggagcagt 
atccgcctga 
atgcttttca 
ttgccatttt 
agtggagaaa 
ggctccaacc 
gtccgtttca 
gtaaccggca 
cgtaccacgt 
ggctggggac 
gactttaccg 
tcaggcacgg 
acctatacgg 
tcggatatgc 
agcctgaaca 
accgtacaca 
ggcaaactgc 
cgcggcaagg 
gccaaaatcg 
gcttccctcg 
gtccgtcgcg 
ctgaaacacg 
gcctccgaat 
atgccgggca 
aatgccgccg 
accgccgccc 
cacaacggca 
cagggcggtg 
accggcgaaa 
aacagtgcaa 
ggcgatatcg 
cgcagcaccg 
ggcgcactgg 
ggtctgcgct 
tggagcggca 
caacccttga 
ggacgcgact 
ggggcacgca 
ggcaacggct 
cacagcggac 



gcacttctgc 
caacagcgaa 
gaagcatgct 
atgccccccc 
atttgatcaa 
gtatcgtcga 
aagaacacgg 
ctgaagacgg 
ctgaagcaaa 
cccatattat 
cgacgctaca 
tccgcaatgc 
gaacaacatc 
accgccaagc 
tgcaacagag 
tcttttcgac 
atgaaaaaga 
agttcaaacg 
attgcggaat 
cccgtacaaa 
cggcggctct 
tgctgacgac 
tgctggatgc 
ccgatacgaa 
gcggcctgat 
gcaaaaccat 
gcgtcgaaac 
gcgacggcat 
tcaaaggcag 
tgaaagtgga 
gggcaggcta 
ggcaggatta 
acagcgtcga 
gcaatgcggc 
ccgtagaaca 
catccgcaac 
tccgccccta 
acggtgtacg 
atgccgatat 
cgggtctgcg 
ttgaaggcaa 
atacgacagc 
atgcaaaaac 
gctatctcaa 
gtgcggacga 
gcggtgtcaa 

acgacctgct 
acagcctcac 
gcgataaagc 
acacggtaac 
atatgccgca 
ggaacggctt 
gagtcggcgt 



gcccgacttc 
atcagcagca 
ctgtgccggt 
cccgaatctg 
cctcaaacct 
cacaggcgaa 
ctataacgaa 
aggcggtaaa 
gccgacggat 
tggcgggcgt 
cataatgaat 
atgggtcaag 
gagggcaggc 
gttgctcgac 
cgattacggc 
aggcaatgac 
cgctcaaaaa 
ggaaatgtat 
tactgccatg 
cccgattcaa 
gctgctgcag 
ggctcaggac 
gggtaaggcc 
aggtacatcc 
caaaaaaggc 
tatcgaaggc 
caaaggtgcg 
tgtctatctg 
tctgcagctg 
cggtacggcg 
tctcaacagt 
ttctttcttc 
aaaaacagcg 
acggactgct 
gggcggcagc 
acccgagacg 
cggcgcaact 
catcttcaac 
gcagggacgc 
cgtcatcgcg 
aatgcgcggc 
agccgccaca 
cgacagcatt 
aggcctgttc 
acatgcggaa 
cgttccgttt 
caaacaggat 
tgaaggcacg 
cgtcctgttt 
gggcggcttt 
cacccgtctg 
ggcacgttac 
aggctaccgg 



<210> 157 

<211> 1389 

<212> PRT 

<213> Artificial sequence 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4170 



50 



<220> 
<223> 



961C-983 



55 



<400> 157 

Met Ala Thr Asn Asp Asp Asp val Lys Lys Ala Ala Thr Val Ala lie 
1 5 10 15 

Ala Ala Ala Tyr Asn Asn Gly Gin Glu He Asn Gly Phe Lys Ala Gly 
20 25 30 



221 



EP 1 790 660 A2 



Glu Thr lie 

35 

Ala Thr Ala 
50 

Lys Lys val 
65 

Asn val Asp 

Thr Thr Lys 

Ala Leu Asp 
115 

Thr Thr Phe 
130 

Lys Leu Glu 
145 

Asn Asp lie 

Ala val Lys 

Gin Asn val 
195 

Ala Glu Ala 
210 

Ala val Ala 
225 

Lys Asp Asn 

Glu Glu Ser 

Thr Glu Lys 
275 

Asp HIS Asp 

290 

Arg Lys Glu 
305 

Leu Phe Gin 

Ala Pro Asp 

Ala Thr Thr 
355 

Glu Met Cys 
370 

Ala val Thr 
385 



Tyr Asp He Asp 
Ala Asp 
Val Thr 



val Glu 
55 



Ala Lys 
85 

Leu Ala 
100 

Ala Thr 

Ala Glu 

Ala val 

Ala Asp 
165 

Thr Ala 
180 

Asp Ala 

Ala Ala 

Ala Lys 

He Ala 
245 

Asp Ser 
260 

Leu Asp 
Thr Arg 
Thr Arg 

Pro Tyr 

325 

Phe Asn 
340 

Ala Lys 
Lys Asp 
Asp Arg 



Asn Leu 
70 

Val Lys 

Asp Thr 

Thr Asn 

Glu Thr 
135 

Ala Asp 
150 

Ser Leu 

Asn Glu 

Lys Val 

Gly Thr 
215 

val Thr 
230 

Lys Lys 

Lys Phe 

Thr Arg 

Leu Asn 
295 

Gin Gly 
310 

Asn val 
Ala Gly 
Ser Ala 



Arg ser 
375 

Asp Ala 
390 



Glu Asp 
40 

Ala Asp 

Thr Lys 

Ala Ala 

Asp Ala 
105 

Ala Leu 

120 

Lys Thr 
Thr Val 
Asp Glu 



Gly Thr 
Asp Phe 

Thr val 

75 

Glu Ser 
90 

Ala Leu 
Asn Lys 
Asn lie 



Asp Lys 
155 

Thr Asn 
170 



Ala Lys 
185 

Lys Ala 
200 



Gin Thr 
Ala Glu 
Ala Asn Thr Ala 



Asp lie 

Ala Asn 

val Arg 
265 

Leu Ala 
280 

Gly Leu 

Leu Ala 

Gly Gly 

Gly Thr 
345 

Ala val 
360 

Met Leu 
Lys lie 



Lys Ala 
235 

Ser Ala 
250 

lie Asp 
ser Ala 
Asp Lys 

Glu Gin 

315 

ser Gly 
330 

Gly lie 
Ser Tyr 
Cys Ala 



Asn Ala 
395 



He Thr Lys 
45 

Lys Gly Leu 
60 

Asn Glu Asn 

Glu lie Glu 

Ala Asp Thr 
110 

Leu Gly Glu 
125 

Val Lys lie 
140 

His Ala Glu 

Thr Lys Ala 

Ala Glu Glu 
190 

Thr Ala Ala 

205 

Ala Asp Lys 
220 

Asp lie Ala 

Asp val Tyr 

Gly Leu Asn 
270 

Glu Lys ser 
285 

Thr Val Ser 
300 

Ala Ala Leu 
Gly Gly Gly 



Gly Ser Asn 
350 

Ala Gly lie 
365 

Gly Arg Asp 
380 

Pro Pro Pro 



Lys Asp 

Gly Leu 

Lys Gin 
80 

Lys Leu 

95 

Asp Ala 

Asn lie 

Asp Glu 

Ala Phe 
160 

Asp Glu 
175 

Thr Lys 

Gly Lys 

Ala Glu 

Thr Asn 
240 

Thr Arg 
255 

Ala Thr 

lie Ala 

Asp Leu 

Ser Gly 
320 

Thr ser 
335 

Ser Arg 
Lys Asn 
Asp val 



Asn Leu 
400 
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His Thr Gly Asp Phe Pro Asn Pro Asn Asp Ala Tyr Lys Asn Leu lie 
405 410 415 

Asn Leu Lys Pro Ala lie Glu Ala Gly Tyr Thr Gly Arg Gly val Glu 
420 425 430 

Val Gly lie val Asp Thr Gly Glu Ser val Gly Ser He Ser Phe Pro 
435 440 445 

Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr Asn Glu Asn Tyr Lys Asn 
450 455 460 

Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro Glu Asp Gly Gly Gly Lys 
465 ' 470 475 480 

Asp He Glu Ala Ser Phe Asp Asp Glu Ala Val He Glu Thr Glu Ala 
485 490 495 

Lys Pro Thr Asp lie Arg His val Lys Glu lie Gly His lie Asp Leu 
500 505 510 

Val Ser His lie lie Gly Gly Arg Ser val Asp Gly Arg Pro Ala Gly 
515 520 525 

Gly lie Ala Pro Asp Ala Thr Leu His lie Met Asn Thr Asn Asp Glu 
530 535 540 

Thr Lys Asn Glu Met Met val Ala Ala lie Arg Asn Ala Trp val Lys 
545 550 555 560 

Leu Gly Glu Arg Gly val Arg lie val Asn Asn ser Phe Gly Thr Thr 
565 570 575 

Ser Arg Ala Gly Thr Ala Asp Leu Phe Gin lie Ala Asn ser Glu Glu 
580 585 590 

Gin Tyr Arg Gin Ala Leu Leu Asp Tyr Ser Gly Gly Asp Lys Thr Asp 
595 600 605 

Glu Gly lie Arg Leu Met Gin Gin Ser Asp Tyr Gly Asn Leu Ser Tyr 
610 615 620 

His lie Arg Asn Lys Asn Met Leu Phe lie Phe Ser Thr Gly Asn Asp 
625 630 635 640 

Ala Gin Ala Gin pro Asn Thr Tyr Ala Leu Leu Pro Phe Tyr Glu Lys 
645 650 655 

Asp Ala Gin Lys Gly lie lie Thr Val Ala Gly Val Asp Arg Ser Gly 
660 665 670 

Glu Lys Phe Lys Arg Glu Met Tyr Gly Glu Pro Gly Thr Glu Pro Leu 
675 680 685 

Glu Tyr Gly Ser Asn His Cys Gly lie Thr Ala Met Trp Cys Leu Ser 
690 695 700 

Ala Pro Tyr Glu Ala Ser val Arg Phe Thr Arg Thr Asn Pro lie Gin 
705 710 715 720 

He Ala Gly Thr Ser Phe Ser Ala Pro He val Thr Gly Thr Ala Ala 
725 730 735 

Leu Leu Leu Gin Lys Tyr Pro Trp Met Ser Asn Asp Asn Leu Arg Thr 
740 745 750 

Thr Leu Leu Thr Thr Ala Gin Asp lie Gly Ala val Gly val Asp Ser 
755 760 765 
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Lys Phe Gly Trp Gly Leu Leu Asp Ala Gly Lys Ala Met Asn Gly Pro 
770 775 780 

Ala ser Phe Pro Phe Gly Asp Phe Thr Ala Asp Thr Lys Gly Thr ser 
785 790 795 800 

Asp lie Ala Tyr ser Phe Arg Asn Asp lie Ser Gly Thr Gly Gly Leu 
805 810 815 

lie Lys Lys Gly Gly Ser Gin Leu Gin Leu His Gly Asn Asn Thr Tyr 
820 825 830 

Thr Gly Lys Thr lie lie Glu Gly Gly Ser Leu val Leu Tyr Gly Asn 
835 840 845 

Asn Lys ser Asp Met Arg val Glu Thr Lys Gly Ala Leu lie Tyr Asn 
850 855 860 

Gly Ala Ala Ser Gly Gly Ser Leu Asn Ser Asp Gly lie val Tyr Leu 
865 870 875 880 

Ala Asp Thr Asp Gin Ser Gly Ala Asn Glu Thr val His lie Lys Gly 
885 890 895 

Ser Leu Gin Leu Asp Gly Lys Gly Thr Leu Tyr Thr Arg Leu Gly Lys 
900 905 910 

Leu Leu Lys val Asp Gly Thr Ala lie lie Gly Gly Lys Leu Tyr Met 
915 920 925 

Ser Ala Arg Gly Lys Gly Ala Gly Tyr Leu Asn ser Thr Gly Arg Arg 
930 935 940 

val Pro Phe Leu ser Ala Ala Lys lie Gly Gin Asp Tyr ser Phe Phe 
945 950 955 960 

Thr Asn lie Glu Thr Asp Gly Gly Leu Leu Ala ser Leu Asp ser val 
965 970 975 

Glu Lys Thr Ala Gly ser Glu Gly Asp Thr Leu ser Tyr Tyr val Arg 
980 985 990 

Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala Ala Ala His ser Ala Pro 
995 1000 1005 

Ala Gly Leu Lys His Ala val Glu Gin Gly Gly ser Asn Leu Glu Asn 
1010 1015 1020 

Leu Met Val Glu Leu Asp Ala Ser Glu Ser ser Ala Thr Pro Glu Thr 
1025 1030 1035 1040 

Val Glu Thr Ala Ala Ala Asp Arg Thr Asp Met Pro Gly lie Arg Pro 
1045 1050 105 5 

Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala Val Gin His Ala Asn Ala 
1060 1065 1070 

Ala Asp Gly Val Arg lie Phe Asn Ser Leu Ala Ala Thr val Tyr Ala 
1075 1080 1085 

Asp Ser Thr Ala Ala His Ala Asp Met Gin Gly Arg Arg Leu Lys Ala 
1090 1095 1100 

Val Ser Asp Gly Leu Asp His Asn Gly Thr Gly Leu Arg val lie Ala 
1105 1110 1115 1120 

Gin Thr Gin Gin Asp Gly Gly Thr Trp Glu Gin Gly Gly val Glu Gly 
1125 1130 1135 
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Lys Met Arg Gly Ser Thr Gin Thr val Gly lie Ala Ala Lys Thr Gly 
1140 1145 1150 

Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly Met Gly Arg Ser Thr Trp 
5 1155 1160 1165 

Ser Glu Asn ser Ala Asn Ala Lys Thr Asp Ser lie Ser Leu Phe Ala 
1170 1175 1180 

Gly lie Arg His Asp Ala Gly Asp lie Gly Tyr Leu Lys Gly Leu Phe 
10 1185 1190 1195 1200 

ser Tyr Gly Arg Tyr Lys Asn ser lie ser Arg ser Thr Gly Ala Asp 
1205 1210 1215 

Glu His Ala Glu Gly Ser val Asn Gly Thr Leu Met Gin Leu Gly Ala 
15 1220 1225 1230 

Leu Gly Gly val Asn val Pro Phe Ala Ala Thr Gly Asp Leu Thr Val 
1235 1240 1245 

Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys Gin Asp Ala Phe Ala Glu 
20 1250 1255 1260 

Lys Gly Ser Ala Leu Gly Trp ser Gly Asn ser Leu Thr Glu Gly Thr 
1265 1270 1275 1280 

Leu val Gly Leu Ala Gly Leu Lys Leu Ser Gin Pro Leu Ser Asp Lys 
25 1285 1290 1295 

Ala val Leu Phe Ala Thr Ala Gly val Glu Arg Asp Leu Asn Gly Arg 
1300 1305 1310 

Asp Tyr Thr Val Thr Gly Gly Phe Thr Gly Ala Thr Ala Ala Thr Gly 
1315 1320 1325 

30 

Lys Thr Gly Ala Arg Asn Met Pro His Thr Arg Leu val Ala Gly Leu 
1330 1335 1340 

Gly Ala Asp val Glu Phe Gly Asn Gly Trp Asn Gly Leu Ala Arg Tyr 
1345 1350 1355 1360 

35 

Ser Tyr Ala Gly Ser Lys Gin Tyr Gly Asn His Ser Gly Arg Val Gly 
1365 1370 1375 

Val Gly Tyr Arg Phe Leu Glu His His His His His His 
1380 1385 

40 

<210> 158 
<211> 2304 
<212> DNA 

<213> Artificial Sequence 

45 <220> 

<223> 961CL-ORF46.1 

<400> 158 

atgaaacact ttccatccaa agtactgacc acagccatcc ttgccacttt ctgtagcggc 60 

gcactggcag ccacaaacga cgacgatgtt aaaaaagctg ccactgtggc cattgctgct 120 

50 gcctacaaca atggccaaga aatcaacggt ttcaaagctg gagagaccat ctacgacatt 180 

gatgaagacg gcacaattac caaaaaagac gcaactgcag ccgatgttga agccgacgac 240 

tttaaaggtc tgggtctgaa aaaagtcgtg actaacctga ccaaaaccgt caatgaaaac 300 

aaacaaaacg tcgatgccaa agtaaaagct gcagaatctg aaatagaaaa gttaacaacc 360 

aagttagcag acactgatgc cgctttagca gatactgatg ccgctctgga tgcaaccacc 420 

aacgccttga ataaattggg agaaaatata acgacatttg ctgaagagac taagacaaat 480 

atcgtaaaaa ttgatgaaaa attagaagcc gtggctgata ccgtcgacaa gcatgccgaa 540 

55 gcattcaacg atatcgccga ttcattggat gaaaccaaca ctaaggcaga cgaagccgtc 600 

aaaaccgcca atgaagccaa acagacggcc gaagaaacca aacaaaacgt cgatgccaaa 660 
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Lys His Ala 

Asn Thr Lys 

195 

Thr Ala Glu 
210 

Glu Thr Ala 

225 

Ala Ala Asp 

Ala Asp lie 

Ala Asp val 
275 

Asp Gly Leu 
290 

Ala Glu Lys 
305 

Lys Thr Val 

Gin Ala Ala 

Gly Gly Gly 
355 

Leu Asp Arg 
370 

ser Arg Gly 
385 

lie Gin Ser 

Lys Gly Asn 

val His ser 
435 

Ala Gly ser 
450 

Gly Tyr Glu 
465 

Gly Tyr Pro 
Lys Gly Val 



165 

Glu Ala 
180 

Ala Asp 

Glu Thr 

Ala Gly 

Lys Ala 
245 

Ala Thr 
260 

Tyr Thr 

Asn Ala 

Ser lie 

Ser Asp 
325 

Leu Ser 
340 

Gly ser 

Gin His 

Glu Leu 

His Gin 
405 

lie Gly 
420 

Pro Phe 
Pro val 
His His 



Ala Pro 
485 

Ala Gin 
500 



Phe Asn Asp 

Glu Ala val 
200 

Lys Gin Asn 
215 

Lys Ala Glu 
230 

Glu Ala val 

Asn Lys Asp 

Arg Glu Glu 
280 

Thr Thr Glu 
295 

Ala Asp His 
310 

Leu Arg Lys 

Gly Leu Phe 

Asp Leu Ala 
360 

Phe Glu Pro 

375 

Ala Glu Arg 
390 

Leu Gly Asn 

Tyr lie val 

Asp Asn His 
440 

Asp Gly Phe 
455 

Pro Ala Asp 
470 

Lys Gly Ala 
Asn lie Arg 



170 

lie Ala 
185 

Lys Thr 

Val Asp 

Ala Ala 

Ala Ala 
250 

Asn lie 
265 

ser Asp 
Lys Leu 
Asp Thr 

Glu Thr 

330 

Gin Pro 

345 

Asn Asp 

Asp Gly 

Ser Gly 

Leu Met 
410 

Arg Phe 
425 

Ala Ser 
Ser Leu 
Gly Tyr 



Arg Asp 
490 

Leu Asn 
505 



Asp Ser Leu 

Ala Asn Glu 
205 

Ala Lys Val 
220 

Ala Gly Thr 

235 

Lys Val Thr 

Ala Lys Lys 

Ser Lys Phe 
285 

Asp Thr Arg 
300 

Arg Leu Asn 
315 

Arg Gin Gly 

Tyr Asn val 

ser Phe lie 
365 

Lys Tyr His 
380 

His lie Gly 
395 

lie Gin Gin 

Ser Asp His 

His Ser Asp 
445 

Tyr Arg lie 
460 

Asp Gly Pro 
475 

He Tyr ser 
Leu Thr Asp 



Thr Gly Gin 
515 



Arg Leu 
Thr Gin Gly val Gly 



Ala Asp Arg 
520 



Asn Ala Gly 
525 



Phe His 

Asp Gly Phe Lys Arg Ala Thr Arg 



175 

Asp Glu Thr 
190 

Ala Lys Gin 

Lys Ala Ala 

Ala Asn Thr 
240 

Asp lie Lys 
255 

Ala Asn ser 
270 

val Arg lie 

Leu Ala Ser 

Gly Leu Asp 
320 

Leu Ala Glu 
335 

Gly Gly ser 
350 

Arg Gin val 

Leu Phe Gly 

Leu Gly Lys 
400 

Ala Ala lie 
415 

Gly His Glu 
430 

Ser Asp Glu 

His Trp Asp 

Gin Gly Gly 
480 

Tyr Asp lie 
495 

Asn Arg Ser 
510 

Ser Met Leu 
Tyr ser pro 
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530 535 540 

Glu Leu Asp Arg Ser Gly Asn Ala Ala Glu Ala Phe Asn Gly Thr Ala 
545 550 555 560 

Asp lie Val Lys Asn He lie Gly Ala Ala Gly Glu He Val Gly Ala 
565 570 575 

Gly Asp Ala val Gin Gly lie Ser Glu Gly Ser Asn lie Ala val Met 
580 585 590 

His Gly Leu Gly Leu Leu Ser Thr Glu Asn Lys Met Ala Arg lie Asn 
595 600 605 

Asp Leu Ala Asp Met Ala Gin Leu Lys Asp Tyr Ala Ala Ala Ala lie 
610 615 620 

Arg Asp Trp Ala Val Gin Asn Pro Asn Ala Ala Gin Gly lie Glu Ala 
625 630 635 640 

val ser Asn lie Phe Met Ala Ala lie Pro lie Lys Gly lie Gly Ala 
645 650 655 

val Arg Gly Lys Tyr Gly Leu Gly Gly lie Thr Ala His Pro lie Lys 
660 665 670 

Arg Ser Gin Met Gly Ala lie Ala Leu Pro Lys Gly Lys Ser Ala Val 
675 680 685 

Ser Asp Asn Phe Ala Asp Ala Ala Tyr Ala Lys Tyr Pro Ser Pro Tyr 
690 695 700 

His Ser Arg Asn lie Arg Ser Asn Leu Glu Gin Arg Tyr Gly Lys Glu 
705 710 715 720 

Asn lie Thr Ser Ser Thr Val Pro Pro Ser Asn Gly Lys Asn Val Lys 
725 730 735 

Leu Ala Asp Gin Arg His Pro Lys Thr Gly Val Pro Phe Asp Gly Lys 
740 745 750 

Gly Phe Pro Asn Phe Glu Lys His val Lys Tyr Asp Thr 
755 760 765 

<210> 160 
<211> 1839 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 961CL-741 
<400> 160 

atgaaacact ttccatccaa agtactgacc acagccatcc ttgccacttt ctgtagcggc 
gcactggcag ccacaaacga cgacgatgtt aaaaaagctg ccactgtggc cattgctgct 
gcctacaaca atggccaaga aatcaacggt ttcaaagctg gagagaccat ctacgacatt 
gatgaagacg gcacaattac caaaaaagac gcaactgcag ccgatgttga agccgacgac 
tttaaaggtc tgggtctgaa aaaagtcgtg actaacctga ccaaaaccgt caatgaaaac 
aaacaaaacg tcgatgccaa agtaaaagct gcagaatctg aaatagaaaa gttaacaacc 
aagttagcag acactgatgc cgctttagca gatactgatg ccgctctgga tgcaaccacc 
aacgccttga ataaattggg agaaaatata acgacatttg ctgaagagac taagacaaat 
atcgtaaaaa ttgatgaaaa attagaagcc gtggctgata ccgtcgacaa gcatgccgaa 
gcattcaacg atatcgccga ttcattggat gaaaccaaca ctaaggcaga cgaagccgtc 
aaaaccgcca atgaagccaa acagacggcc gaagaaacca aacaaaacgt cgatgccaaa 
gtaaaagctg cagaaactgc agcaggcaaa gccgaagctg ccgctggcac agctaatact 
gcagccgaca aggccgaagc tgtcgctgca aaagttaccg acatcaaagc tgatatcgct 
acgaacaaag ataatattgc taaaaaagca aacagtgccg acgtgtacac cagagaagag 
tctgacagca aatttgtcag aattgatggt ctgaacgcta ctaccgaaaa attggacaca 
cgcttggctt ctgctgaaaa atccattgcc gatcacgata ctcgcctgaa cggtttggat 
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aaaacagtgt cagacctgcg caaagaaacc 

tccggtctgt tccaacctta caacgtgggt 

atcggtgcgg ggcttgccga tgcactaacc 
5 cagtctttga cgctggatca gtccgtcagg 

QQtgcggaaa aaacttatgg aaacggtgac 

aaggtcagcc gtttcgactt tatccgccaa 

gagagtggag agttccaagt atacaaacaa 

gagcaaatac aagattcgga gcattccggg 

ggcgacatag cgggcgaaca tacatctttt 

tatcgcggga cggcgttcgg ttcagacgat 
10 ttcgccgcca agcagggaaa cggcaaaatc 

gacctggccg ccgccgatat caagccggat 

gtcctttaca accaagccga gaaaggcagt 

caggaagttg ccggcagcgc ggaagtgaaa 

gccgccaagc aactcgagca ccaccaccac 

15 <210> 161 

<211> 612 

<212> PRT 

<213> Artificial sequence 
<220> 

20 <223> 961CL-741 

<400> 161 

Met Lys His Phe Pro ser i_ys val Leu Thr Thr Ala lie Leu Ala Thr 
1 5 10 15 

phe cys ser Gly Ala Leu Ala Ala Thr Asn Asp Asp Asp val Lys Lys 

25 2 0 2 5 30 

Ala Ala Thr val Ala lie Ala Ala Ala Tyr Asn Asn Gly Gin Glu lie 
35 40 45 

Asn Gly Phe Lys Ala Gly Glu Thr lie Tyr Asp lie Asp Glu Asp Gly 

30 50 55 60 

Thr lie Thr Lys Lys Asp Ala Thr Ala Ala Asp val Glu Ala Asp Asp 
65 70 75 80 

Phe Lys Gly Leu Gly Leu Lys Lys val Val Thr Asn Leu Thr Lys Thr 

35 85 90 95 

val Asn Glu Asn Lys Gin Asn Val Asp Ala Lys val Lys Ala Ala Glu 
100 105 110 

Ser Glu lie Glu Lys Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala 
115 120 125 

40 

Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn 
130 135 140 

Lys Leu Gly Glu Asn lie Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn 
145 150 155 160 

45 

He val Lys lie Asp Glu Lys Leu Glu Ala val Ala Asp Thr val Asp 
165 170 175 

Lys His Ala Glu Ala Phe Asn Asp lie Ala Asp Ser Leu Asp Glu Thr 
180 185 190 

50 

Asn Thr Lys Ala Asp Glu Ala Val Lys Thr Ala Asn Glu Ala Lys Gin 
195 200 205 

Thr Ala Glu Glu Thr Lys Gin Asn Val Asp Ala Lys Val Lys Ala Ala 
210 215 220 

55 Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr 

225 230 235 240 



cgccaaggcc ttgcagaaca 
ggatccggag ggggtggtgt 
gcaccgctcg accataaaga 
aaaaacgaga aactgaagct 
agcctcaata cgggcaaatt 
atcgaagtgg acgggcagct 
agccattccg ccttaaccgc 
aagatggttg cgaaacgcca 
gacaagcttc ccgaaggcgg 
gccggcggaa aactgaccta 
gaacatttga aatcgccaga 
ggaaaacgcc atgccgtcat 
tactccctcg gtatctttgg 
accgtaaacg gcatacgcca 
caccactga 



agccgcgctc 1020 

cgccgccgac 1080 

caaaggtttg 1140 

ggcggcacaa 1200 

gaagaacgac 1260 

cattaccttg 1320 

ctttcagacc 1380 

gttcagaatc 1440 

cagggcgaca 1500 

caccatagat 1560 

actcaatgtc 1620 

cagcggttcc 1680 

cggaaaagcc 1740 

tatcggcctt 1800 
1839 
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Ala Ala 

Ala Asp 

Ala Asp 

Asp Gly 
290 

Ala Glu 
305 

Lys Thr 
Gin Ala 
Gly Gly 

Leu Thr 

370 

Leu Asp 
385 

Gly Ala 

Leu Lys 

val Asp 

Lys Gin 
450 

Asp Ser 
465 

Gly Asp 

Gly Arg 

Gly Lys 

Lys lie 
530 

Ala Asp 
545 

val Leu 
Gly Gly 
Asn Gly 



Asp Lys Ala 
245 

lie Ala Thr 
260 

val Tyr Thr 

275 

Leu Asn Ala 

Lys Ser lie 

val ser Asp 
325 

Ala Leu Ser 
340 

Gly Gly val 
355 

Ala Pro Leu 

Gin ser val 

Glu Lys Thr 
405 

Asn Asp Lys 
420 

Gly Gin Leu 
435 

Ser His Ser 

Glu His Ser 

He Ala Gly 
485 

Ala Thr Tyr 
500 

Leu Thr Tyr 
515 

Glu His Leu 
lie Lys Pro 



Glu Ala val Ala 
Asn Lys 
Arg Glu 



Asp Asn 
265 



Tyr Asn Gin 
565 

Lys Ala Gin 
580 

He Arg His 
595 



Thr Thr 

295 

Ala Asp 
310 

Leu Arg 

Gly Leu 

Ala Ala 

Asp His 
375 

Arg Lys 
390 

Tyr Gly 

val ser 

lie Thr 

Ala Leu 
455 

Gly Lys 
470 

Glu His 

Arg Gly 

Thr He 

Lys Ser 
535 

Asp Gly 
550 

Ala Glu 
Glu val 
lie Gly 



Glu Ser 
280 

Glu Lys 

His Asp 

Lys Glu 

Phe Gin 
345 

Asp lie 
360 

Lys Asp 
Asn Glu 
Asn Gly 



Ala Lys 
250 

lie Ala 
Asp Ser 
Leu Asp 

Thr Arg 

315 

Thr Arg 
330 

Pro Tyr 
Gly Ala 
Lys Gly 



Lys Leu 
395 

Asp Ser 
410 



Arg Phe 
425 

Leu Glu 
440 



Asp Phe 
Ser Gly 
Thr Ala Phe Gin 



Met val Ala Lys 
475 

Thr Ser Phe Asp 
490 

Thr Ala Phe Gly 

505 

Asp Phe Ala Ala 
520 

Pro Glu Leu Asn 



Lys Arg His Ala 
555 

Lys Gly Ser Tyr 
570 

Ala Gly Ser Ala 
585 

Leu Ala Ala Lys 
600 



val Thr 

Lys Lys 

Lys Phe 
285 

Thr Arg 
300 

Leu Asn 

Gin Gly 

Asn val 

Gly Leu 
365 

Leu Gin 
380 

Lys Leu 

Leu Asn 

lie Arg 

Glu Phe 
445 

Thr Glu 
460 

Arg Gin 

Lys Leu 

Ser Asp 

Lys Gin 
525 

val Asp 
540 

val lie 
Ser Leu 
Glu Val 



Asp lie Lys 
255 

Ala Asn Ser 
270 

Val Arg lie 

Leu Ala Ser 

Gly Leu Asp 
320 

Leu Ala Glu 

335 

Gly Gly Ser 
350 

Ala Asp Ala 

Ser Leu Thr 

Ala Ala Gin 
400 

Thr Gly Lys 
415 

Gin lie Glu 
430 

Gin val Tyr 

Gin lie Gin 

Phe Arg lie 
480 

Pro Glu Gly 
495 

Asp Ala Gly 
510 

Gly Asn Gly 
Leu Ala Ala 



Gin Leu 
605 



Ser Gly Ser 
560 

Gly lie Phe 
575 

Lys Thr Val 
590 

Glu His His 
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His His His His 
610 

<210> 162 

<211> 4218 

<212> DNA 

<213> Artificial Sequence 



10 



<220> 
<223> 



961CL-983 



15 



20 



25 



30 



35 



40 



45 



50 



55 



<400> 162 
atgaaacact 
gcactggcag 
gcctacaaca 
gatgaagacg 
tttaaaggtc 
aaacaaaacg 
aagttagcag 
aacgccttga 
atcgtaaaaa 
gcattcaacg 
aaaaccgcca 
gtaaaagctg 
gcagccgaca 
acgaacaaag 
tctgacagca 
cgcttggctt 
aaaacagtgt 
tccggtctgt 
gacttcaatg 
gcagcagtat 
gccggtcggg 
aatctgcata 
aaacctgcaa 
ggcgaatccg 
aacgaaaatt 
ggtaaagaca 
acggatatcc 
gggcgttccg 
atgaatacga 
gtcaagctgg 
gcaggcactg 
ctcgactatt 
tacggcaacc 
aatgacgcac 
caaaaaggca 
atgtatggag 
gccatgtggt 
attcaaattg 
ctgcagaaat 
caggacatcg 
aaggccatga 
acatccgata 
aaaggcggca 
gaaggcggtt 
ggtgcgctga 
tatctggcag 
cagctggacg 
acggcgatta 
aacagtaccg 
ttcttcacaa 
acagcgggca 
actgcttcgg 
ggcagcaatc 
gagacggttg 
gcaactttcc 
ttcaacagtc 



ttccatccaa 
ccacaaacga 
atggccaaga 
gcacaattac 
tgggtctgaa 
tcgatgccaa 
acactgatgc 
ataaattggg 
ttgatgaaaa 
atatcgccga 
atgaagccaa 
cagaaactgc 
aggccgaagc 
ataatattgc 
aatttgtcag 
ctgctgaaaa 
cagacctgcg 
tccaacctta 
caggcggtac 
cttacgccgg 
atgacgttgc 
ccggagactt 
ttgaagcagg 
tcggcagcat 
acaaaaacta 
ttgaagcttc 
gccacgtaaa 
tggacggcag 
atgatgaaac 
gcgaacgtgg 
ccgacctttt 
ccggcggtga 
tgtcctacca 
aagctcagcc 
ttatcacagt 
aaccgggtac 
gcctgtcggc 
ccggaacatc 
acccgtggat 
gtgcagtcgg 
acggacccgc 
ttgcctactc 
gccaactgca 
cgctggtgtt 
tttataacgg 
ataccgacca 
gcaaaggtac 
tcggcggcaa 
gacgacgtgt 
acatcgaaac 
gtgaaggcga 
cagcggcaca 
tggaaaacct 
aaactgcggc 
gcgcagcggc 
tcgccgctac 



agtactgacc 
cgacgatgtt 
aatcaacggt 
caaaaaagac 
aaaagtcgtg 
agtaaaagct 
cgctttagca 
agaaaatata 
attagaagcc 
ttcattggat 
acagacggcc 
agcaggcaaa 
tgtcgctgca 
taaaaaagca 
aattgatggt 
atccattgcc 
caaagaaacc 
caacgtgggt 
cggtatcggc 
tatcaagaac 
ggttacagac 
tccaaaccca 
ctatacagga 
atcctttccc 
tacggcgtat 
tttcgacgat 
agaaatcgga 
acctgcaggc 
caagaacgaa 
cgtgcgcatc 
ccaaatagcc 
taaaacagac 
catccgtaat 
caacacatat 
cgcaggcgta 
agaaccgctt 
accctatgaa 
cttttccgca 
gagcaacgac 
cgtggacagc 
gtcctttccg 
cttccgtaac 
actgcacggc 
gtacggcaac 
ggcggcatcc 
atccggcgca 
gctgtacaca 
gctgtacatg 
tcccttcctg 
cgacggcggc 
cacgctgtcc 
ttccgcgccc 
gatggtcgaa 
agccgaccgc 
agccgtacag 
cgtctatgcc 



acagccatcc 
aaaaaagctg 
ttcaaagctg 
gcaactgcag 
actaacctga 
gcagaatctg 
gatactgatg 
acgacatttg 
gtggctgata 
gaaaccaaca 
gaagaaacca 
gccgaagctg 
aaagttaccg 
aacagtgccg 
ctgaacgcta 
gatcacgata 
cgccaaggcc 
ggatccggcg 
agcaacagca 
gaaatgtgca 
agggatgcca 
aatgacgcat 
cgcggggtag 
gaactgtatg 
atgcggaagg 
gaggccgtta 
cacatcgatt 
ggtattgcgc 
atgatggttg 
gtcaataaca 
aattcggagg 
gagggtatcc 
aaaaacatgc 
gccctattgc 
gaccgcagtg 
gagtatggct 
gcaagcgtcc 
cccatcgtaa 
aacctgcgta 
aagttcggct 
ttcggcgact 
gacatttcag 
aacaacacct 
aacaaatcgg 
ggcggcagcc 
aacgaaaccg 
cgtttgggca 
tcggcacgcg 
agtgccgcca 
ctgctggctt 
tattatgtcc 
gccggtctga 
ctggatgcct 
acagatatgc 
catgcgaatg 
gacagtaccg 



ttgccacttt 
ccactgtggc 
gagagaccat 
ccgatgttga 
ccaaaaccgt 
aaatagaaaa 
ccgctctgga 
ctgaagagac 
ccgtcgacaa 
ctaaggcaga 
aacaaaacgt 
ccgctggcac 
acatcaaagc 
acgtgtacac 
ctaccgaaaa 
ctcgcctgaa 
ttgcagaaca 
gaggcggcac 
gagcaacaac 
aagacagaag 
aaatcaatgc 
acaagaattt 
aggtaggtat 
gcagaaaaga 
aagcgcctga 
tagagactga 
tggtctccca 
ccgatgcgac 
cagccatccg 
gttttggaac 
agcagtaccg 
gcctgatgca 
ttttcatctt 
cattttatga 
gagaaaagtt 
ccaaccattg 
gtttcacccg 
ccggcacggc 
ccacgttgct 
ggggactgct 
ttaccgccga 
gcacgggcgg 
atacgggcaa 
atatgcgcgt 
tgaacagcga 
tacacatcaa 
aactgctgaa 
gcaagggggc 
aaatcgggca 
ccctcgacag 
gtcgcggcaa 
aacacgccgt 
ccgaatcatc 
cgggcatccg 
ccgccgacgg 
ccgcccatgc 



ctgtagcggc 
cattgctgct 
ctacgacatt 
agccgacgac 
caatgaaaac 
gttaacaacc 
tgcaaccacc 
taagacaaat 
gcatgccgaa 
cgaagccgtc 
cgatgccaaa 
agctaatact 
tgatatcgct 
cagagaagag 
attggacaca 
cggtttggat 
agccgcgctc 
ttctgcgccc 
agcgaaatca 
catgctctgt 
cccccccccg 
gatcaacctc 
cgtcgacaca 
acacggctat 
agacggaggc 
agcaaagccg 
tattattggc 
gctacacata 
caatgcatgg 
aacatcgagg 
ccaagcgttg 
acagagcgat 
ttcgacaggc 
aaaagacgct 
caaacgggaa 
cggaattact 
tacaaacccg 
ggctctgctg 
gacgacggct 
ggatgcgggt 
tacgaaaggt 
cctgatcaaa 
aaccattatc 
cgaaaccaaa 
cggcattgtc 
aggcagtctg 
agtggacggt 
aggctatctc 
ggattattct 
cgtcgaaaaa 
tgcggcacgg 
agaacagggc 
cgcaacaccc 
cccctacggc 
tgtacgcatc 
cgatatgcag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3240 

3300 

3360 
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ggacgccgcc tgaaagccgt atcggacggg ttggaccaca acggcacggg tctgcgcgtc 
atcgcgcaaa cccaacagga cggtggaacg tgggaacagg gcggtgttga aggcaaaatg 
cgcggcagta cccaaaccgt cggcattgcc gcgaaaaccg gcgaaaatac gacagcagcc 
gccacactgg gcatgggacg cagcacatgg agcgaaaaca gtgcaaatgc aaaaaccgac 
agcattagtc tgtttgcagg catacggcac gatgcgggcg atatcggcta tctcaaaggc 
ctgttctcct acggacgcta caaaaacagc atcagccgca gcaccggtgc ggacgaacat 
gcggaaggca gcgtcaacgg cacgctgatg cagctgggcg cactgggcgg tgtcaacgtt 
ccgtttgccg caacgggaga tttgacggtc gaaggcggtc tgcgctacga cctgctcaaa 
caggatgcat tcgccgaaaa aggcagtgct ttgggctgga gcggcaacag cctcactgaa 
ggcacgctgg tcggactcgc gggtctgaag ctgtcgcaac ccttgagcga taaagccgtc 
ctgtttgcaa cggcgggcgt ggaacgcgac ctgaacggac gcgactacac ggtaacgggc 
ggctttaccg gcgcgactgc agcaaccggc aagacggggg cacgcaatat gccgcacacc 
cgtctggttg ccggcctggg cgcggatgtc gaattcggca acggctggaa cggcttggca 
cgttacagct acgccggttc caaacagtac ggcaaccaca gcggacgagt cggcgtaggc 
taccggttct gactcgag 

<210> 163 
<211> 1403 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> 961CL-983 
<400> 163 

Met Lys His Phe Pro Ser Lys val Leu Thr Thr Ala lie Leu Ala Thr 
15 10 15 

Phe cys Ser Gly Ala Leu Ala Ala Thr Asn Asp Asp Asp val Lys Lys 
20 25 30 

Ala Ala Thr Val Ala He Ala Ala Ala Tyr Asn Asn Gly Gin Glu He 
35 40 45 

Asn Gly Phe Lys Ala Gly Glu Thr lie Tyr Asp lie Asp Glu Asp Gly 
50 55 60 

Thr lie Thr Lys Lys Asp Ala Thr Ala Ala Asp val Glu Ala Asp Asp 
65 70 75 80 

Phe Lys Gly Leu Gly Leu Lys Lys Val Val Thr Asn Leu Thr Lys Thr 
85 90 95 

val Asn Glu Asn Lys Gin Asn val Asp Ala Lys val Lys Ala Ala Glu 
100 105 110 

Ser Glu lie Glu Lys Leu Thr Thr Lys Leu Ala Asp Thr Asp Ala Ala 
115 120 125 

Leu Ala Asp Thr Asp Ala Ala Leu Asp Ala Thr Thr Asn Ala Leu Asn 
130 135 140 

Lys Leu Gly Glu Asn lie Thr Thr Phe Ala Glu Glu Thr Lys Thr Asn 
145 150 155 ' 160 

lie val Lys lie Asp Glu Lys Leu Glu Ala val Ala Asp Thr val Asp 
165 170 175 

Lys His Ala Glu Ala Phe Asn Asp lie Ala Asp Ser Leu Asp Glu Thr 
180 185 190 

Asn Thr Lys Ala Asp Glu Ala val Lys Thr Ala Asn Glu Ala Lys Gin 
195 200 205 

Thr Ala Glu Glu Thr Lys Gin Asn Val Asp Ala Lys Val Lys Ala Ala 
210 215 220 

Glu Thr Ala Ala Gly Lys Ala Glu Ala Ala Ala Gly Thr Ala Asn Thr 
225 230 235 240 



232 



EP 1 790 660 A2 



Ala Ala Asp Lys Ala Glu Ala val Ala Ala Lys val Thr Asp lie Lys 
245 250 255 

Ala Asp lie Ala Thr Asn Lys Asp Asn lie Ala Lys Lys Ala Asn Ser 
260 265 270 

Ala Asp val Tyr Thr Arg Glu Glu Ser Asp Ser Lys Phe Val Arg lie 
275 280 285 

Asp Gly Leu Asn Ala Thr Thr Glu Lys Leu Asp Thr Arg Leu Ala Ser 
290 295 300 

Ala Glu Lys Ser lie Ala Asp His Asp Thr Arg Leu Asn Gly Leu Asp 
305 310 315 320 

Lys Thr val Ser Asp Leu Arg Lys Glu Thr Arg Gin Gly Leu Ala Glu 

325 ^ 330 335 

Gin Ala Ala Leu Ser Gly Leu Phe Gin Pro Tyr Asn Val Gly Gly Ser 
340 345 350 

Gly Gly Gly Gly Thr Ser Ala Pro Asp Phe Asn Ala Gly Gly Thr Gly 
355 360 365 

lie Gly Ser Asn ser Arg Ala Thr Thr Ala Lys ser Ala Ala val Ser 
370 375 380 

Tyr Ala Gly lie Lys Asn Glu Met Cys Lys Asp Arg ser Met Leu Cys 
385 390 395 ^ 400 

Ala Gly Arg Asp Asp val Ala val Thr Asp Arg Asp Ala Lys lie Asn 
405 410 415 

Ala Pro Pro Pro Asn Leu His Thr Gly Asp Phe Pro Asn Pro Asn Asp 
420 425 430 

Ala Tyr Lys Asn Leu lie Asn Leu Lys Pro Ala lie Glu Ala Gly Tyr 
435 440 445 

Thr Gly Arg Gly val Glu val Gly lie Val Asp Thr Gly Glu ser val 
450 455 460 

Gly ser lie ser Phe Pro Glu Leu Tyr Gly Arg Lys Glu His Gly Tyr 
465 470 475 480 

Asn Glu Asn Tyr Lys Asn Tyr Thr Ala Tyr Met Arg Lys Glu Ala Pro 
485 490 495 

Glu Asp Gly Gly Gly Lys Asp lie Glu Ala Ser Phe Asp Asp Glu Ala 
500 505 510 

val lie Glu Thr Glu Ala Lys Pro Thr Asp lie Arg His val Lys Glu 

515 520 525 

lie Gly His lie Asp Leu val Ser His lie lie Gly Gly Arg ser val 
530 535 540 

Asp Gly Arg Pro Ala Gly Gly lie Ala Pro Asp Ala Thr Leu His lie 
545 550 555 560 

Met Asn Thr Asn Asp Glu Thr Lys Asn Glu Met Met val Ala Ala lie 
565 570 575 

Arg Asn Ala Trp val Lys Leu Gly Glu Arg Gly Val Arg lie Val Asn 
580 585 590 

Asn Ser Phe Gly Thr Thr Ser Arg Ala Gly Thr Ala Asp Leu Phe Gin 
595 600 605 
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lie Ala 
610 

Gly Gly 
625 

Tyr Gly 

Phe Ser 

Leu pro 

Gly val 
690 

Pro Gly 
705 

Ala Met 
Arg Thr 
val Thr 



Asn Asp 
770 

Ala val 

785 



Asn Ser Glu Glu 
Asp Lys 
Asn Leu 



Thr Asp 
630 



Thr Gly 
660 

Phe Tyr 
675 

Asp Arg 
Thr Glu 
Trp Cys 



Ser Tyr 
645 

Asn Asp 
Glu Lys 
Ser Gly 



Pro Leu 
710 

Leu Ser 
725 



lie Gin 
Ala Ala 
Asn Leu Arg Thr 



Asn Pro 
740 

Gly Thr 
755 



Gly val 



Lys Ala Met Asn 
Asp Thr 
Ser Gly 



Lys Gly 
820 



His Gly 
850 

Leu Val 
865 

Gly Ala 
Asp Gly 
Thr val 



Thr Gly 
835 

Asn Asn 



Leu Tyr 
Leu lie 



lie val 
900 

His lie 
915 



Tyr Thr 
930 



Arg Leu 
Lys Leu 
Asn Ser Thr Gly 



Gly Gly 
945 



Asp Ser 
790 

Gly Pro 
805 

Thr Ser 

Gly Leu 

Thr Tyr 

Gly Asn 
870 

Tyr Asn 
885 

Tyr Leu 
Lys Gly 
Gly Lys 



Tyr Met 
950 

Arg Arg 
965 



Gin Tyr 
615 

Glu Gly 

His lie 

Ala Gin 

Asp Ala 
680 

Glu Lys 
695 

Glu Tyr 

Ala Pro 

lie Ala 

Leu Leu 
760 

Thr Leu 
775 

Lys Phe 

Ala Ser 

Asp lie 

lie Lys 
840 

Thr Gly 
855 

Asn Lys 

Gly Ala 

Ala Asp 

Ser Leu 
920 

Leu Leu 

935 

Ser Ala 
val Pro 



Arg Gin 
lie Arg 



Arg Asn 
650 

Ala Gin 
665 



Ala Leu 
620 



Leu Asp Tyr Ser 
Gin Gin 
Lys Asn Met Leu 



Leu Met 
635 



Ser Asp 
640 



Pro Asn 



Gin Lys Gly lie 
Phe Lys 
Gly Ser 



Arg Glu 
700 



Tyr Glu 
730 

Gly Thr 
745 

Leu Gin 

Leu Thr 

Gly Trp 

Phe Pro 
810 

Ala Tyr 
825 

Lys Gly 

Lys Thr 

ser Asp 

Ala ser 
890 

Thr Asp 
905 

Gin Leu 
Lys val 
Arg Gly 



Phe Leu 

970 



Asn His 
715 

Ala Ser 

Ser Phe 

Lys Tyr 

Thr Ala 
780 

Gly Leu 

795 

Phe Gly 
Ser Phe 
Gly Ser 



lie lie 
860 

Met Arg 
875 



Thr Tyr 
670 

lie Thr 
685 

Met Tyr 
cys Gly 
Val Arg 



ser Ala 
750 

Pro Trp 
765 



Phe lie 
655 

Ala Leu 
val Ala 
Gly Glu 



lie Thr 
720 

Phe Thr 

735 

Pro lie 



Met Ser 



Gin Asp lie Gly 
Leu Asp 
Asp Phe 



Ala Gly 
800 



Arg Asn 
830 

Gin Leu 

845 

Glu Gly 
Val Glu 



Gly Gly ser Leu 
Gin ser 
Asp Gly 



Gly Ala 
910 



Asp Gly 
940 

Lys Gly 
955 



Lys Gly 
925 

Thr Ala 



Thr Ala 
815 

Asp lie 

Gin Leu 

Gly Ser 

Thr Lys 
880 

Asn ser 
895 

Asn Glu 
Thr Leu 
lie lie 



Ala Gly Tyr Leu 
960 



Ser Ala Ala Lys lie Gly 
975 
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Gin Asp Tyr ser Phe Phe Thr Asn lie Glu Thr Asp Gly Gly Leu Leu 
980 985 990 

Ala Ser Leu Asp Ser Val Glu Lys Thr Ala Gly Ser Glu Gly Asp Thr 
995 1000 1005 

Leu Ser Tyr Tyr Val Arg Arg Gly Asn Ala Ala Arg Thr Ala Ser Ala 
1010 1015 1020 

Ala Ala His Ser Ala Pro Ala Gly Leu Lys His Ala val Glu Gin Gly 
1025 1030 1035 1040 

Gly Ser Asn Leu Glu Asn Leu Met val Glu Leu Asp Ala Ser Glu Ser 
1045 1050 1055 

Ser Ala Thr Pro Glu Thr Val Glu Thr Ala Ala Ala Asp Arg Thr Asp 
1060 1065 1070 

Met Pro Gly lie Arg Pro Tyr Gly Ala Thr Phe Arg Ala Ala Ala Ala 
1075 ~ 1080 1085 

val Gin His Ala Asn Ala Ala Asp Gly val Arg lie Phe Asn Ser Leu 
1090 1095 1100 

Ala Ala Thr Val Tyr Ala Asp Ser Thr Ala Ala His Ala Asp Met Gin 
1105 1110 1115 1120 

Gly Arg Arg Leu Lys Ala Val Ser Asp Gly Leu Asp His Asn Gly Thr 
1125 1130 1135 

Gly Leu Arg Val lie Ala Gin Thr Gin Gin Asp Gly Gly Thr Trp Glu 
1140 1145 1150 

Gin Gly Gly val Glu Gly Lys Met Arg Gly Ser Thr Gin Thr Val Gly 
1155 1160 1165 

He Ala Ala Lys Thr Gly Glu Asn Thr Thr Ala Ala Ala Thr Leu Gly 
1170 1175 1180 

Met Gly Arg ser Thr Trp ser Glu Asn Ser Ala Asn Ala Lys Thr Asp 
1185 1190 1195 1200 

Ser lie Ser Leu Phe Ala Gly lie Arg His Asp Ala Gly Asp lie Gly 
1205 1210 1215 

Tyr Leu Lys Gly Leu Phe Ser Tyr Gly Arg Tyr Lys Asn Ser lie Ser 
1220 1225 1230 

Arg Ser Thr Gly Ala Asp Glu His Ala Glu Gly Ser val Asn Gly Thr 
1235 1240 1245 

Leu Met Gin Leu Gly Ala Leu Gly Gly Val Asn val Pro Phe Ala Ala 
1250 1255 1260 

Thr Gly Asp Leu Thr Val Glu Gly Gly Leu Arg Tyr Asp Leu Leu Lys 
1265 1270 1275 1280 

Gin Asp Ala Phe Ala Glu Lys Gly Ser Ala Leu Gly Trp Ser Gly Asn 
1285 1290 1295 

Ser Leu Thr Glu Gly Thr Leu Val Gly Leu Ala Gly Leu Lys Leu Ser 
1300 1305 1310 

Gin Pro Leu Ser Asp Lys Ala Val Leu Phe Ala Thr Ala Gly Val Glu 
1315 1320 1325 

Arg Asp Leu Asn Gly Arg Asp Tyr Thr val Thr Gly Gly Phe Thr Gly 
1330 1335 1340 



235 



EP 1 790 660 A2 



Ala Thr Ala Ala Thr Gly Lys Thr Gly Ala Arg Asn Met Pro His Thr 
1345 1350 1355 1360 

Arg Leu val Ala Gly Leu Gly Ala Asp val Glu Phe Gly Asn Gly Trp 
1365 1370 1375 

Asn Gly Leu Ala Arg Tyr Ser Tyr Ala Gly Ser Lys Gin Tyr Gly Asn 
1380 1385 1390 

His Ser Gly Arg Val Gly Val Gly Tyr Arg Phe 
1395 1400 

<210> 164 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 164 

cgcggatccg ctagcaaaac aaccgacaaa egg 

<210> 165 

<211> 27 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 165 

cccgctcgag ttaccagegg tagecta 

<210> 166 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 166 

etagctageg gacacactta ttteggcate 

<210> 167 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 167 

cccgctcgag ttaccagegg tagectaatt tg 

<210> 168 

<211> 10 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 168 
cccgctcgag 

<210> 169 
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<211> 33 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> 01 i gonucl eoti de 

<400> 169 

cgcggatccc atatgaaaac cttcttcaaa acc 

<210> 170 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> 01 i gonucl eoti de 

<400> 170 

cccgctcgag ttatttggct gcgccttc 

<210> 171 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oli gonucl eotide 

<400> 171 

gcggcattaa tatgttgaga aaattgttga aatgg 

<210> 172 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 172 

gcggcctcga gttatttttt caaaatatat ttgc 

<210> 173 

<211> 33 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oligonucleotide 

<400> 173 

gcggccatat gttacctaac cgtttcaaaa tgt 

<210> 174 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oli gonucl eotide 

<400> 174 

gcggcctcga gttatttccg aggttttcgg g 

<210> 175 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<22 3> Ol i gonucl eoti de 

<400> 175 

cgcggatccc atatgacacg cttcaaatat tc 

<210> 176 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 176 

cccgctcgag ttatttaaac cgataggtaa a 

<210> 177 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 177 

cgcggatccc atatgggcag ggaagaaccg c 

<210> 178 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucleotide 

<400> 178 

gcccaagctt atcgatggaa tagccgcg 

<210> 179 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol igonucleotide 

<400> 179 

cgcggatccg ctagcaacgg tttggatgcc eg 

<210> 180 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucleotide 

<400> 180 

cccgctcgag tttgtctaag ttcctgatat 

<210> 181 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<2 2 3> Ol i gonucl eot i de 
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<400> 181 

cccgctcgag attcccacct gccatc 

<210> 182 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 182 

cgcggatccg ctagcatgaa tttgcctatt caaaaat 

<210> 183 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 183 

cccgctcgag ttaattccca cctgccatc 

<210> 184 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 184 

cgcggatccg ctagcatgaa tttgcctatt caaaaat 

<210> 185 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 185 

cccgctcgag ttggacgatg cccgcga 

<210> 186 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 186 

cgcggatccg ctagcatgaa tttgcctatt caaaaat 

<210> 187 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 187 

cccgctcgag ttattggacg atgcccgc 
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<210> 188 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 

<400> 188 

cgcggatccc atatgtatcg caaactgatt gc 

<210> 189 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> Ol i gonucl eoti de 

<400> 189 

cccgctcgag ctaatcgatg gaatagcc 

<210> 190 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 190 

cgcggatccc atatgaaaca gacagtcaaa tg 

<210> 191 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 191 

cccgctcgag tcaataaccc gccttcag 

<210> 192 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 192 

cgcggatccc atatgttacg tttgactgct ttagccgtat gcacc 

<210> 193 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 193 

cccgctcgag ttattttgcc gcgttaaaag cgtcggcaac 

<210> 194 

<211> 32 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 194 

cgcggatccc atatgaacaa aatataccgc at 

<210> 195 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 195 

cccgctcgag ttaccactga taaccgac 

<210> 196 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 196 

cgcggatccc atatgaccga tgacgacgat ttat 

<210> 197 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 197 

gcccaagctt ccactgataa ccgacaga 

<210> 198 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oli gonucl eoti de 

<400> 198 

cgcggatccc atatgaacaa aatataccgc at 

<210> 199 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> ol i gonucl eoti de 

<400> 199 

gcccaagctt ttaccactga taaccgac 

<210> 200 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<22 3> 01 i gonucl eoti de 

<400> 200 

gggaattcca tatgggcatt tcccgcaaaa tatc 

<210> 201 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 201 

cccgctcgag ttatttactc ctataacgag gtctcttaac 

<210> 202 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucl eotide 

<400> 202 

gggaattcca tatgtcagat ttggcaaacg attctt 

<210> 203 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 203 

cccgctcgag ttatttactc ctataacgag gtctcttaac 

<210> 204 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucl eoti de 

<400> 204 

gggaattcca tatgggcatt tcccgcaaaa tatc 

<210> 205 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 205 

cccgctcgag ttacgtatca tatttcacgt gc 

<210> 206 

<211> 35 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucl eotide 
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<400> 206 

gggaattcca tatgcacgtg aaatatgata cgaag 

<210> 207 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 207 

cccgctcgag tttactccta taacgaggtc tcttaac 

<210> 208 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 208 

gggaattcca tatgtcagat ttggcaaacg attctt 

<210> 209 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 209 

cccgctcgag cgtatcatat ttcacgtgc 

<210> 210 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 210 

gggaattcca tatgtcagat ttggcaaacg attctt 

<210> 211 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 211 

cccgctcgag tttactccta taacgaggtc tcttaac 

<210> 212 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 212 

cgcggatccc atatgcaaaa tgcgttcaaa atccc 
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<210> 213 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 213 

cgcggatccc atatgaacaa aatataccgc at 

<210> 214 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 214 

cccgctcgag tttgctttcg atagaacgg 

<210> 215 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 215 

gcggccatat ggtcataaaa tatacaaatt tgaa 

<210> 216 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 216 

gcggcctcga gttagcctga gacctttgca aatt 

<210> 217 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 217 

gcggccatat gaaacagaaa aaaaccgctg 

<210> 218 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 218 

gcggcctcga gttacggttt gacaccgttt tc 

<210> 219 

<211> 30 

<212> DNA 
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<213> Artificial sequence 
<220> 

<223> oligonucleotide 

<400> 219 

cgcggatccc atatgaaaac cctgctcctc 

<210> 220 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 220 

cccgctcgag ttatcctcct ttgcggc 

<210> 221 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 221 

gcggccatat ggcaaaaatg atgaaatggg 

<210> 222 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 222 

gcggcctcga gttatcggcg cggcgggcc 

<210> 223 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oligonucleotide 

<400> 223 

gcggccatat gaaaaaatcc tccctcatca 

<210> 224 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 224 

gcggcctcga gttatttgcc gccgtttttg gc 

<210> 225 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<22 3> ol i gonucl eoti de 

<400> 225 

gcggccatat ggcccctgcc gacgcggtaa g 

<210> 226 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 226 

gcggcctcga gtttgccgcc gtttttggct ttc 

<210> 227 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 227 

gcggccatat gaaacacata ctccccctga 

<210> 228 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 228 

gcggcctcga gttattcgcc tacggttttt tg 

<210> 229 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> Ol i gonucl eoti de 

<400> 229 

gcggccatat gatttacatc gtactgtttc 

<210> 230 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 230 

gcggcctcga gttaggagaa caggcgcaat gc 

<210> 231 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> ol i gonucl eoti de 

<400> 231 
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gcggccatat gtacaacatg tatcaggaaa ac 

<210> 232 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 232 

gcggcctcga gggagaacag gcgcaatgcg g 

<210> 233 

<211> 29 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 233 

cgcggatccg ctagctgcgg cacggcggg 

<210> 234 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 234 

cccgctcgag ataacggtat gccgccag 

<210> 235 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 235 

cgcggatccc atatggaatc aacactttca c 

<210> 236 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 236 

cccgctcgag ttacacgcgg ttgctgt 

<210> 237 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 237 

cgcggatccc atatgaacaa cagacatttt g 

<210> 238 
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<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 238 

cccgctcgag ttacctgtcc ggtaaaag 

<210> 239 

<211> 33 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 239 

cgcggatccg ctagcaccgt catcaaacag gaa 

<210> 240 

<211> 27 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 240 

cccgctcgag tcaagattcg acgggga 

<210> 241 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 241 

cgcggatccc atatgtccgc aaacgaatac g 

<210> 242 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 242 

cccgctcgag tcagtgttct gccagttt 

<210> 243 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 243 

cgcggatccc atatgccgtc tgaaacacg 

<210> 244 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<22 3> Ol i gonucl eoti de 

<400> 244 

cccgctcgag ttagcggagc agtttttc 

<210> 245 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 245 

cgcggatccc atatgaccgc catcagcc 

<210> 246 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 246 

cccgctcgag ttaaagccgg gtaacgc 

<210> 247 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oli gonucl eoti de 

<400> 247 

gcggccatat ggaaacacag ctttacatcg g 

<210> 248 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 248 

gcggcctcga gtcaataata atatcccgcg 

<210> 249 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> ol i gonucl eoti de 

<400> 249 

gcggccatat gattaaaatc cgcaatatcc 

<210> 250 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 
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<400> 250 

gcggcctcga gttaaatctt ggtagattgg atttgg 

<210> 251 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> 01 i gonucl eoti de 

<400> 251 

gcggccatat gactgacaac gcactgctcc 

<210> 252 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 252 

gcggcctcga gtcagaccgc gttgtcgaaa c 

<210> 253 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 253 

cgcggatccc atatggcgtt aaaaacatca aa 

<210> 254 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 254 

cccgctcgag tcagcccttc atacagc 

<210> 255 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 255 

gcggcattaa tggcacaaac tacactcaaa cc 

<210> 256 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 256 

gcggcctcga gttaaaactt cacgttcacg ccg 
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<210> 257 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 257 

gcggcattaa tgcatgaaac tgagcaatcg gtgg 

<210> 258 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 258 

gcggcctcga gaaacttcac gttcacgccg ccggtaaa 

<210> 259 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 259 

cgcggatccc atatgggcaa atccgaaaat acg 

<210> 260 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 260 

cccgctcgag ataatggcgg cggcgg 

<210> 261 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 261 

cgcggatccc atatgtttcc ccccgacaa 

<210> 262 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 262 

cccgctcgag tcattctgta aaaaaagtat g 

<210> 263 

<211> 32 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 263 

cgcggatccc atatgcttca aagcgacagc ag 

<210> 264 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 264 

cccgctcgag ttcggatttt tgcgtactc 

<210> 265 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 265 

cgcggatccc atatggcaat ggcagaaaac g 

<210> 266 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oli gonucl eotide 

<400> 266 

cccgctcgag ctatacaatc cgtgccg 

<210> 267 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eotide 

<400> 267 

cgcggatccc atatggattc ttttttcaaa cc 

<210> 268 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 268 

cccgctcgag tcagttcaga aagcggg 

<210> 269 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<22 3> 01 i gonucl eoti de 

<400> 269 

cgcggatccc atatgaaacc tttgatttta gg 

<210> 270 

<211> 28 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 270 

cccgctcgag ttatttgggc tgctcttc 

<210> 271 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 271 

cgcggatccc atatggtaat cgtctggttg 

<210> 272 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 272 

cccgctcgag ctacgacttg gttaccg 

<210> 273 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 273 

gcggccatat gagacgtaaa atgctaaagc tac 

<210> 274 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 274 

gcggcctcga gtcaaagtgt tctgtttgcg c 

<210> 275 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 
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<400> 275 

gccgccatat gttgacttta acccgaaaaa 

<210> 276 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 276 

gccgcctcga ggccggcggt caataccgcc cgaa 

<210> 277 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 277 

cgcggatccc atatggcgca atgcgatttg ac 

<210> 278 

<211> 27 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 278 

cccgctcgag .ttcggcggta aatgccg 

<210> 279 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 279 

gcggccatat ggcggggccg atttttgt 

<210> 280 

<211> 33 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> oli gonucl eoti de 

<400> 280 

gcggcctcga gttatttgct ttcagtatta ttg 

<210> 281 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 281 

gcggccatat gaactttgct ttatccgtca 
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<210> 282 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> 01 i gonucl eoti de 

<400> 282 

gcggcctcga gttaacggca gtatttgttt ac 

<210> 283 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 283 

cgcggatccc atatgggttt gcgcttcggg c 

<210> 284 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 284 

gcccaagctt ttttcctttg ccgtttccg 

<210> 285 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 285 

cgcggatccc atatggccga cctttccgaa aa 

<210> 286 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 286 

cccgctcgag gaagcgcgtt cccaagc 

<210> 287 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 287 

cgcggatccc atatgcacga cacccgtac 

<210> 288 

<211> 28 

<212> DNA 
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<213> Artificial sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 288 

cccgctcgag ttagaagcgc gttcccaa 

<210> 289 

<211> 34 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 289 

ctagctagct ttaaacgcag cgtaatcgca atgg 

<210> 290 

<211> 31 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti deucl eoti de 

<400> 290 

cccgctcgag tcaatcctgc tcttttttgc c 

<210> 291 

<211> 25 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 291 

ctagctagcg ggggcggcgg tggcg 

<210> 292 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 292 

cccgctcgag tcaatcctgc tcttttttgc c 

<210> 293 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 293 

ctagctagcg ctcatcctcg ccgcctgcgg gggcggcggt 

<210> 294 

<211> 31 

<212> DNA 

<213> Artificial Sequence 

<220> 
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<223> 01 i gonucl eoti de 

<400> 294 

cccgctcgag tcaatcctgc tcttttttgc c 

<210> 295 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 295 

cggggatccg ggggcggcgg tggcg 

<210> 296 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 296 

cccgctcgag tcaatcctgc tcttttttgc c 

<210> 297 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 297 

ctagctagcg ggggcggcgg tggcg 

<210> 298 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 298 

cccgctcgag atcctgctct tttttgcc 

<210> 299 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Ol i gonucl eot i de 

<400> 299 

ctagctagct gcgggggcgg cggtggcg 



<210> 


300 


<211> 


28 


<212> 


DNA 


<213> 


Arti f i ci al Sequence 


<220> 




<223> 


Oligonucleotide 


<400> 


300 
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cccgctcgag atcctgctct tttttgcc 

<210> 301 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eot i de 

<400> 301 

cgcggatccg ctagccccga tgttaaatcg gc 

<210> 302 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 302 

cgcggatccg ctagccaaga tatggcggca gt 

<210> 303 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 303 

cgcggatccg ctagcgccga atccgcaaat ca 

<210> 304 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 304 

cgcgctagcg gaagggttga tttggctaat gg 

<210> 305 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 305 

cgcgctagcg gaagggttga tttggctaat gg 

<210> 306 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 306 

cgccatatgt ttaaacgcag cgtaatcgc 

<210> 307 
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<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 307 

cccgctcgag aaaattgcta ccgccattcg cagg 

<210> 308 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleoti de 

<400> 308 

cgccatatgg gaagggttga tttggctaat gg 

<210> 309 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 309 

cccgctcgag cttgtcttta taaatgatga catatttg 

<210> 310 

<211> 40 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 310 

cccgctcgag tttataaaag ataatatatt gattgattcc 

<210> 311 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 311 

cgcgctagca tgccgctgat tcccgtcaat c 

<210> 312 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 312 

ctagctagcg ggggcggcgg tggcg 

<210> 313 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Ol i gonucl eoti de 

<400> 313 

cccgctcgag tcaatcctgc tcttttttgc c 

<210> 314 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 314 

cgcggatccg ctagccccga tgttaaatcg gc 

<210> 315 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 315 

cccgctcgag atcctgctct tttttgcc 

<210> 316 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 316 

cgcggatccg ctagccccga tgttaaatcg gc 

<210> 317 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 317 

cccgctcgag tcaatcctgc tcttttttgc c 

<210> 318 

<211> 87 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 318 

cgcggatccg ctagctttga acgcagtgtg attgcaatgg cttgtatttt tgccctttca 
gcctgttcgc ccgatgttaa atcggcg 

<210> 319 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Ol i gonucl eoti de 

<400> 319 

cccgctcgag tcaatcctgc tcttttttgc c 

<210> 320 

<211> 90 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 320 

cgcggatccg ctagcaaaac cttcttcaaa accctttccg ccgccgcact cgcgctcatc 
ctcgccgcct gctcgcccga tgttaaatcg 

<210> 321 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 321 

cccgctcgag tcaatcctgc tcttttttgc c 

<210> 322 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 322 

cgcggatccc atatgaaaac caagttaatc aaa 

<210> 323 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 323 

cccgctcgag ttattgattt ttgcggatga 

<210> 324 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 324 

cgcggatccc atatgttaaa tcgggtattt tatc 

<210> 325 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 
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<400> 325 

cccgctcgag ttaatccgcc attccctg 

<210> 326 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> 01 i gonucl eoti de 

<400> 326 

gcggccatat gaaattacaa caattggctg 

<210> 327 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 327 

gcggcctcga gttaccttac gtttttcaaa g 

<210> 328 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 328 

cgcggatccc atatgcaagc acggctgct 

<210> 329 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 329 

cccgctcgag tcaaggttgt ccttgtcta 

<210> 330 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 330 

cgcggatccc atatgatgaa accgcacaac 

<210> 331 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> ol i gonucl eoti de 

<400> 331 

cccgctcgag tcagttgctc aacacgtc 
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<210> 332 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 332 

cgcggatccc atatggtaga cgcgcttaag ca 

<210> 333 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 333 

cccgctcgag agctgcatgg cggcg 

<210> 334 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 334 

cgcggatccc atatggcacg gtcgttatac 

<210> 335 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 335 

cccgctcgag ctaccgcgca ttcctg 

<210> 336 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 336 

gcggccatat ggaatttttc attatcttgt t 

<210> 337 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 337 

gcggcctcga gttatttggc ggttttgctg c 

<210> 338 

<211> 32 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 338 

gcggccatat gaagtatgtc cggttatttt tc 

<210> 339 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucl eoti de 

<400> 339 

gcggcctcga gttatcggct tgtgcaacgg 

<210> 340 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 340 

cgcggatccg ctagctccgg cagcaaaacc ga 

<210> 341 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol igonucleotide 

<400> 341 

gcccaagctt acgcagttcg gaatggag 

<210> 342 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 342 

gccgccatat gttgaatatt aaactgaaaa ccttg 

<210> 343 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide 

<400> 343 

gccgcctcga gttatttctg atgccttttc cc 

<210> 344 

<211> 29 

<212> DNA 

<213> Artificial sequence 

<220> 
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<22 3> Ol i gonucl eoti de 

<400> 344 

gccgccatat ggacaataag accaaactg 

<210> 345 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eot i de 

<400> 345 

gccgcctcga gttaacggtg cggacgtttc 

<210> 346 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oligonucleotide 

<400> 346 

cgcggatccc atatgaacaa actgtttctt ac 

<210> 347 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oli gonucl eoti de 

<400> 347 

cccgctcgag tcattccgcc ttcagaaa 

<210> 348 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eot i de 

<400> 348 

cgcggatccc atatgcaagg tatcgttgcc gacaaatccg cacct 

<210> 349 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 349 

cccgctcgag agctaattgt gcttggtttg cagataggag tt 

<210> 350 

<211> 52 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 350 
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cgcggatccc atatgaaccg caccctgtac aaagttgtat ttaacaaaca tc 

<210> 351 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 351 

cccgctcgag ttaagctaat tgtgcttggt ttgcagatag gagtt 

<210> 352 

<211> 46 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 352 

cgcggatccc atatgacggg agaaaatcat gcggtttcac ttcatg 

<210> 353 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 353 

cccgctcgag agctaattgt gcttggtttg cagataggag tt 

<210> 354 

<211> 51 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucl eotide 

<400> 354 

cgcggatccc atatggtttc agacggccta tacaaccaac atggtgaaat t 

<210> 355 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucl eotide 

<400> 355 

cccgctcgag gcggtaactg ccgcttgcac tgaatccgta a 

<210> 356 

<211> 46 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 356 

cgcggatccc atatgacggg agaaaatcat gcggtttcac ttcatg 

<210> 357 
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<211> 41 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 357 

cccgctcgag gcggtaactg ccgcttgcac tgaatccgta a 

<210> 358 

<211> 49 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 358 

cgcggatccc atatgcaaag caaagtcaaa gcagaccatg cctccgtaa 

<210> 359 

<211> 56 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 359 

cccgctcgag tcttttcctt tcaattataa ctttagtagg ttcaattttg gtcccc 

<210> 360 

<211> 51 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 360 

cgcggatccc atatggtttc agacggccta tacaaccaac atggtgaaat t 

<210> 361 

<211> 56 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 361 

cccgctcgag tcttttcctt tcaattataa ctttagtagg ttcaattttg gtcccc 

<210> 362 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 362 

gcggccatat gacccgtttg acccgcg 

<210> 363 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Ol i gonucl eoti de 

<400> 363 

gcggcctcga gtcagcgggc gttcatttct t 

<210> 364 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 364 

cgcggatccc atatgaacac cattttcaaa ate 

<210> 365 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oligonucleotide 

<400> 365 

cccgctcgag ttaatttact tttttgatgt eg 

<210> 366 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oligonucleotide 

<400> 366 

geggecatat ggattcgccc aaggtegg 

<210> 367 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 367 

gcggcctcga gctacacttc ccccgaagtg g 

<210> 368 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 368 

cgcggatccc atatgatagt tgaccaaagc c 

<210> 369 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 
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<400> 369 

cccgctcgag ttatttttcc gatttttcgg 

<210> 370 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 370 

gcggccatat gcttgaactg aacggact 

<210> 371 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 371 

gcggcctcga gtcagcggaa gcggacgatt 

<210> 372 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucl eotide 

<400> 372 

cgcggatccc atatgtccaa actcaaaacc atcg 

<210> 373 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 373 

cccgctcgag gcttccaatc agtttgacc 

<210> 374 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 374 

gcggccatat gagcgcaatc gttgatattt tc 

<210> 375 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 375 

gcggcctcga gttatttgcc cagttggtag aatg 
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<210> 376 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 376 

gcggccatat ggtgatacat ccgcactact tc 

<210> 377 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 377 

gcggcctcga gtcaaaatcg agttttacac ca 

<210> 378 

<211> 31 

<212> DNA 

<213> Arti f i ci al Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 378 

gcggccatat gaccatctat ttcaaaaacg g 

<210> 379 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 379 

gcggcctcga gtcagccgat gtttagcgtc catt 

<210> 380 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 380 

cgcggatccc atatgagcag cggagggggt g 

<210> 381 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 381 

cccgctcgag ttgcttggcg gcaaggc 

<210> 382 

<211> 31 



270 



EP 1 790 660 A2 



<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> 01 i gonucl eoti de 

<400> 382 

cgcggatccc atatggtcgc cgccgacatc g 

<210> 383 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 383 

cccgctcgag ttgcttggcg gcaaggc 

<210> 384 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 384 

cgcggatccc atatgggcgg ttcggaaggc g 

<210> 385 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 385 

cccgctcgag ttgaacactg atgtcttttc cga 

<210> 386 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 386 

cgcggatccg ctagcaaact gtcgttggtg ttaac 

<210> 387 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 387 

cccgctcgag ttgacccgct ccacgg 

<210> 388 

<211> 31 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> 01 i gonucl eoti de 

<400> 388 

gccgccatat ggcggacttg gcgcaagacc c 

<210> 389 

<211> 39 

<212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> ol i gonucl eoti de 

<400> 389 

gccgcctcga gatctcctaa acctgtttta acaatgccg 

<210> 390 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 390 

gccgccatat ggcggacttg gcgcaagacc c 

<210> 391 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucl eotide 

<400> 391 

gcggcctcga gctccatgct gttgccccag c 

<210> 392 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 392 

gccgccatat ggcggacttg gcgcaagacc c 

<210> 393 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 393 

gcggcctcga gaaaatcccc gctaaccgca g 

<210> 394 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
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<400> 394 

cgcggatccc atatgagcag cggagggggt g 

<210> 395 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 395 

cccgctcgag ttgcttggcg gcaaggc 

<210> 396 

<211> 31 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 396 

cgcggatccc atatggtcgc cgccgacatc g 

<210> 397 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 397 

cccgctcgag ttgcttggcg gcaaggc 

<210> 398 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 398 

cgcggatccc atatggacgg tgttgtgcct gtt 

<210> 399 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oli gonucl eoti de 

<400> 399 

cccgctcgag cttacggatc aaattgacg 

<210> 400 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 400 

cgcggatccc atatgggcag ccaatctgaa gaa 
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<210> 401 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 401 

cccgctcgag ctcagctttt gccgtcaa 

<210> 402 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 402 

cgcggatccg ctagctactc atccattgtc cgc 

<210> 403 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 403 

cccgctcgag ccagttgtag cctattttg 

<210> 404 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<2 23> Ol i gonucl eoti de 

<400> 404 

cgcggatccg ctagcatgcg cttcacacac ac 

<210> 405 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 405 

cccgctcgag ttaccagttg tagcctattt 

<210> 406 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eotide 

<400> 406 

gccgccatat ggcacaaacg gaaggtttgg aa 

<210> 407 

<211> 36 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 407 

gccgcctcga gaaaactgta acgcaggttt gccgtc 

<210> 408 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 408 

gcggccatat ggaagaaaca ccgcgcgaac eg 

<210> 409 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> ol i gonucl eoti de 

<400> 409 

gcggcctcga ggaacgtttt attaaactcg ac 

<210> 410 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 410 

gcggccatat gagaaaaccg accgataccc ta 

<210> 411 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 411 

gcggcctcga gtcaacgcca ctgccagcgg ttg 

<210> 412 

<211> 48 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 412 

cgcggatccc atatgaagaa gaacatattg gaattttggg teggactg 

<210> 413 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Ol i gonucl eoti de 

<400> 413 

cccgctcgag ttattcggcg gctttttccg cattgccg 

<210> 414 

<211> 103 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 414 

gggaattcca tatgaaaaag acagctatcg cgattgcagt ggcactggct ggtttcgcta 
ccgtagcgca ggccgctagc gctttccgcg tggccggcgg tgc 

<210> 415 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 415 

cccgctcgag ttattcggcg gctttttccg cattgccg 

<210> 416 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 416 

catgccatgg ctttccgcgt ggccggcggt gc 

<210> 417 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 417 

cccgctcgag ttattcggcg gctttttccg cattgccg 

<210> 418 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 418 

cgcggatccc atatgtttgc cgaaacccgc c 

<210> 419 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 
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<400> 419 

cccgctcgag aggttgtgtt ccaggttg 

<210> 420 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 420 

cgcggatccc atatgaaaaa aaccgcctat g 

<210> 421 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 421 

cccgctcgag ttaaggttgt gttccagg 

<210> 422 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 422 

cgcggatccc atatgaaaaa atacctattc cgc 

<210> 423 

<211> 27 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 423 

cccgctcgag ttacgggcgg tattcgg 

<210> 424 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 424 

cgcggatccc atatgcaaag caagagcatc caaa 

<210> 425 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 425 

cccgctcgag ttacgggcgg tattcgg 
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<210> 426 

<211> 86 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 

<400> 426 

gggaattcca tatgaaaacc ttcttcaaaa ccctttccgc cgccgcgcta gcgctcatcc 
tcgccgcctg ccaaagcaag agcatc 

<210> 427 

<211> 38 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide 

<400> 427 

cccgctcgag ttacgggcgg tattcgggct tcataccg 

<210> 428 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 428 

cgcggatccg tcgactgtgg gggcggcggt ggc 

<210> 429 

<211> 31 

<212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> Ol i gonucl eoti de 

<400> 429 

cccgctcgag tcaatcctgc tcttttttgc c 

<210> 430 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 430 

gcggccatat gaagaaaaca ttgacactgc 

<210> 431 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 

<400> 431 

gcggcctcga gttaatggtg cgaatgaccg at 

<210> 432 

<211> 46 



278 



EP 1 790 660 A2 



<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 432 

ggggacaagt ttgtacaaaa aagcaggctt gcggcaagga tgccgg 

<210> 433 

<211> 47 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 433 

ggggaccact ttgtacaaga aagctgggtc taaagcaaca atgccgg 

<210> 434 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 434 

cgcggatccc atatgaaaca caccgtatcc 

<210> 435 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 435 

cccgctcgag ttatctcgtg cgcgcc 

<210> 436 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 436 

cgcggatccc atatgagccc cgcgccgatt 

<210> 437 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 437 

cccgctcgag tttttgtgcg gtcaggcg 

<210> 438 

<211> 62 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Ol i gonucl eoti de 

<400> 438 

ggggacaagt ttgtacaaaa aagcaggctt 
tc 

<210> 439 

<211> 30 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> ol i gonucl eoti de 

<400> 439 

cgcggatccc atatggcgga tgcgcccgcg 

<210> 440 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 440 

cccgctcgag aaaccgccaa tccgcc 

<210> 441 

<211> 61 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 441 

ggggaccact ttgtacaaga aagctgggtt 
t 

<210> 442 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 442 

cgcggatccc atatgaaacc caaaccgcac 

<210> 443 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 443 

cccgctcgag tcagcgttgg acgtagt 

<210> 444 

<211> 33 

<212> DNA 

<213> Artificial sequence 

<220> 



gttcgtttgg gggatttaaa ccaaaccaaa 



cattttgttt ttccttcttc tcgaggccat 
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<22 3> Ol i gonucl eoti de 

<400> 444 

gggaattcca tatgaaaaaa atcatcttcg ccg 

<210> 445 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 445 

cccgctcgag ttattgtttg gctgcctcga t 

<210> 446 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 446 

gggaattcca tatggccacc tacaaagtgg acg 

<210> 447 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 447 

cggggatcct tgtttggctg cctcgatttg 

<210> 448 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucl eotide 

<400> 448 

cgcggatccc atatgcaaga acaatcgcag aaag 

<210> 449 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 449 

cccgctcgag ttttttcggc aaattggctt 

<210> 450 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 450 
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ggggacaagt ttgtacaaaa aagcaggctg ccgatgccgt tgcgg 

<210> 451 

<211> 47 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 451 

ggggaccact ttgtacaaga aagctgggtt cagggtcgtt tgttgcg 

<210> 452 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 452 

cgcggatccc atatgaaaca ctttccatcc 

<210> 453 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 453 

cccgctcgag ttaccactcg taattgac 

<210> 454 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 454 

cgcggatccc atatggccac aagcgacgac 

<210> 455 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 45 5 

cccgctcgag ttaccactcg taattgac 

<210> 456 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eot i de 

<400> 456 

cgcggatccc atatggccac aaacgacg 

<210> 457 
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<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 457 

cccgctcgag acccacgttg taaggttg 

<210> 458 

<211> 32 

<212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> Ol i gonucl eoti de 

<400> 458 

cgcggatccc atatggccac aagcgacgac ga 

<210> 459 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 459 

cccgctcgag acccacgttg taaggttg 

<210> 460 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 460 

cgcggatccc atatgatgaa acactttcca tec 

<210> 461 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 461 

cccgctcgag ttaacccacg ttgtaaggt 

<210> 462 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 462 

cgcggatccc atatgatgaa acactttcca tec 

<210> 463 

<211> 29 

<212> DNA 

<213> Artificial sequence 
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<220> 

<22 3> Ol i gonucl eoti de 

<400> 463 

cccgctcgag ttaacccacg ttgtaaggt 

<210> 464 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 464 

cgcggatccc atatggccac aaacgacg 

<210> 465 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 465 

cccgctcgag gtctgacact gttttatcc 



<210> 


466 


<211> 


33 


<212> 


DNA 


<213> 


Arti f i ci al Sequence 


<220> 




<223> 


Oligonucleotide 


<400> 


466 



cgcggatccc atatgatgaa acactttcca tec 

<210> 467 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 467 

cccgctcgag ttatgctttg gcggcaaag 

<210> 468 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 468 

cgcggatccc atatggccac aaacgacgac 

<210> 469 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> ol i gonucl eoti de 
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<400> 469 

cgcggatccc cactcgtaat tgacgcc 

<210> 470 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 470 

cgcggatccc atatggccac aagcgacgac 

<210> 471 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 471 

cgcggatccc cactcgtaat tgacgcc 

<210> 472 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 472 

cgcggatccc atatggccac aaacgacgac 

<210> 473 

<211> 27 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 473 

cgcggatcca cccacgttgt aaggttg 

<210> 474 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 474 

cgcggatccc atatgatgaa acactttcca tec 

<210> 475 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 475 

cgcggatcca cccacgttgt aaggttg 
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<210> 476 

<211> 25 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 476 

cgcggatccg gagggggtgg tgtcg 

<210> 477 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 477 

cccgctcgag ttgcttggcg gcaaggc 

<210> 478 

<211> 25 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 478 

cgcggatccg gcggaggcgg cactt 

<210> 479 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 479 

cccgctcgag gaaccggtag cctacg 

<210> 480 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 480 

cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt 

<210> 481 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oli gonucl eoti de 

<400> 481 

cccgctcgag cgtatcatat ttcacgtgc 

<210> 482 

<211> 25 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<2 2 3> Ol i gonucl eot i de 

<400> 482 

cgcggatccg gagggggtgg tgtcg 

<210> 483 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 483 

cccgctcgag ttattgcttg gcggcaag 

<210> 484 

<211> 25 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 484 

cgcggatccg gcggaggcgg cactt 

<210> 485 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 485 

cccgctcgag tcagaaccgg tagcctac 

<210> 486 

<211> 41 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 486 

cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 

<210> 487 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oligonucleotide 

<400> 487 

cccgctcgag ttacgtatca tatttcacgt gc 

<210> 488 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Ol i gonucl eoti de 

<400> 488 

cgcggatccc atatggccac aagcgacgac g 

<210> 489 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 489 

cccgctcgag ccactcgtaa ttgacgcc 

<210> 490 

<211> 30 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 490 

cgcggatccc atatggccac aaacgacgac 

<210> 491 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 491 

cccgctcgag tgctttggcg gcaaagtt 

<210> 492 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 492 

cgcggatccc atatggccac aaacgacgac 

<210> 493 

<211> 37 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> ol i gonucl eoti de 

<400> 493 

cccgctcgag tttagcaata ttatctttgt tcgtagc 

<210> 494 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 



288 



EP 1 790 660 A2 



<400> 494 

cgcggatccc atatgaaagc aaaccgtgcc ga 

<210> 495 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 495 

cccgctcgag ccactcgtaa ttgacgcc 

<210> 496 

<211> 61 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 496 

ggggacaagt ttgtacaaaa aagcaggctg cagccacaaa cgacgacgat gttaaaaaag 
c 

<210> 497 

<211> 61 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 497 

ggggaccact ttgtacaaga aagctgggtt taccactcgt aattgacgcc gacatggtag 
9 

<210> 498 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 498 

gcggccatat ggcagcaaaa gacgtacagt t 

<210> 499 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 499 

gcggcctcga gttacatcat gccgcccata cca 

<210> 500 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 500 
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cgcggatccg ctagcttagg cggcggcgga g 

<210> 501 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> 01 i gonucl eoti de 

<400> 501 

cccgctcgag gaaccggtag cctacg 

<210> 502 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 502 

cccctagcta gcacttctgc gcccgactt 

<210> 503 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 503 

cccgctcgag gaaccggtag cctacg 

<210> 504 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 504 

cgcggatccg ctagcttagg cggcggcgga g 

<210> 505 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 505 

cccgctcgag gaaccggtag cctacg 

<210> 506 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 506 

cgcggatccg ctagcacttc tgcgcccgac tt 

<210> 507 
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<211> 


26 








<212> 


DNA 








<213> 


Arti f i ci al Sequence 






5 


<220> 










<223> 


Oligonucleotide 








<400> 


507 








cccgctcgag gaaccggtag cctacg 




26 


10 


<210> 


508 








<211> 


50 








<212> 


DNA 








<213> 


Artificial Sequence 








<220> 








15 


<223> 


Oligonucleotide 








<400> 


508 








cgcggatccg ctagccgaac gaccccaacc 


ttccctacaa aaactttcaa 


50 




<210> 


509 








<211> 


35 






20 


<212> 


DNA 








<213> 


Arti f i ci al Sequence 








<220> 










<223> 


Oligonucleotide 






25 


<400> 


509 








cccgctcgag tcagaaccga cgtgccaagc 


cgttc 


35 




<210> 


510 








<211> 


32 








<212> 


DNA 






30 


<213> 


Arti f i ci al Sequence 








<220> 










<223> 


oligonucleotide 








<400> 


510 






35 


gccgccatat gcccccactg gaagaacgga 


eg 


32 




<210> 


511 








<211> 


35 








<212> 


DNA 








<213> 


Arti f i ci al Sequence 






40 


<220> 










<223> 


Oligonucleotide 








<400> 


511 








gccgcctcga gtaataaacc ttctatgggc 


agcag 


35 


45 


<210> 


512 








<211> 


31 








<212> 


DNA 








<213> 


Arti f i ci al Sequence 








<220> 








50 


<223> 


Oligonucleotide 








<400> 


512 








cgcggatccc atatgtccgt ccacgcatcc 


g 


31 




<210> 


513 






55 


<211> 


31 






<212> 


DNA 








<213> 


Arti f i ci al Sequence 
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<220> 

<223> Ol i gonucl eot i de 

<400> 513 

cccgctcgag tttgaatttg taggtgtatt g 

<210> 514 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 514 

cgcggatccc atatgacccc ttccgcact 

<210> 515 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 515 

cccgctcgag ttatttgaat ttgtaggtgt at 

<210> 516 

<211> 33 

<212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> Ol i gonucl eoti de 

<400> 516 

cgcggatccc atatgaaaac caattcagaa gaa 

<210> 517 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 517 

cccgctcgag tccacagaga ttgtttcc 

<210> 518 

<211> 17 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 518 
gatgcccgaa gggcggg 

<210> 519 

<211> 29 

<212> DNA 

<213> Artificial Sequence 

<220> 

<22 3> Ol i gonucl eoti de 
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<400> 519 

gcccaagctt tcagaagaag acttcacgc 

<210> 520 

<211> 36 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 520 

cgcggatccc atatgcaaac ccataaatac gctatt 

<210> 521 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 521 

gcccaagctt gaagaagact tcacgccag 

<210> 522 

<211> 35 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 522 

cgcggatccc atatggtctt tttcgacaat accga 

<210> 523 

<211> 10 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 523 
gcccaagctt 

<210> 524 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 524 

cgcggatccc atatgaataa aactttaaaa aggcgg 

<210> 525 

<211> 29 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eot i de 

<400> 525 

gcccaagctt tcagaagaag acttcacgc 
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<210> 526 

<211> 35 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> ol i gonucl eoti de 

<400> 526 

cgcgaatccc atatgttcga tcttgattct gtcga 

<210> 527 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> oligonucleotide 

<400> 527 

cccgctcgag tcgcacaggc tgttggcg 

<210> 528 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<2 23> Ol i gonucl eoti de 

<400> 528 

cgcgaatccc atatgttggg cggaggcggc ag 

<210> 529 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oli gonucl eotide 

<400> 529 

cccgctcgag tcgcacaggc tgttggcg 

<210> 530 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 530 

cgcgaatccc atatgttggg cggaggcggc ag 

<210> 531 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 531 

cccgctcgag tcgcacaggc tgttggcg 

<210> 532 

<211> 33 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 532 

cgcggatccc atatggcaaa tttggaggtg cgc 

<210> 533 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 533 

cccgctcgag ttcggagcgg ttgaagc 

<210> 534 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 534 

cgcggatccc atatgcaacg tcgtattata accc 

<210> 535 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 535 

cccgctcgag ttattcggag cggttgaag 

<210> 536 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 536 

cgcggatccc atatgggcat caaagtcgcc atcaacggct 

<210> 537 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol igonucl eotide 

<400> 537 

cccgctcgag tttgagcggg cgcacttcaa gtccg 

<210> 538 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Ol i gonucl eoti de 

<400> 538 

cgcggatccc atatgggcgg cagcgaaaaa aac 

<210> 539 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 539 

cccgctcgag gttggtgccg actttgat 

<210> 540 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 540 

cgcggatccc atatgggcgg cggaagcgat a 

<210> 541 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 541 

cccgctcgag tttgcccgct ttgagcc 

<210> 542 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 542 

cgcggatccc atatgggcaa atccgaaaat acg 

<210> 543 

<211> 27 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 543 

cccgctcgag catcccgtac tgtttcg 

<210> 544 

<211> 62 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 
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<400> 544 

ggggacaagt ttgtacaaaa aagcaggctc cgacattacc gtgtacaacg gccaacaaag 
aa 

<210> 545 

<211> 61 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 545 

ggggaccact ttgtacaaga aagctgggtc ttatttcata ccggcttgct caagcagccg 

g 

<210> 546 

<211> 61 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 546 

ggggacaagt ttgtacaaaa aagcaggctg atacggtgtt ttcctgtaaa acggacaaca 
a 

<210> 547 

<211> 60 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 547 

ggggaccact ttgtacaaga aagctgggtc taggaaaaat cgtcatcgtt gaaattcgcc 

<210> 548 

<211> 47 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 548 

ggggacaagt ttgtacaaaa aagcaggcta tgcaccccat cgaaacc 

<210> 549 

<211> 47 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 549 

ggggaccact ttgtacaaga aagctgggtc tagtcttgca gtgcctc 

<210> 550 

<211> 41 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 
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<400> 550 

cgcggatccc atatgggaaa tttcttatat agaggcatta g 

<210> 551 

<211> 40 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 551 

cccgctcgag gttaatttct atcaactctt tagcaataat 

<210> 552 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> ol i gonucl eoti de 

<400> 552 

cgcggatccc atatggcctg ccaagacgac a 

<210> 553 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oli gonucl eotide 

<400> 553 

cccgctcgag ccgcctcctg ccgaaa 

<210> 554 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 554 

cgcggatccc atatggcaga gatctgtttg ataa 

<210> 555 

<211> 27 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 555 

cccgctcgag cggttttccg cccaatg 

<210> 556 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 556 

cgcggatccc atatgcagcc ggatacggtc 
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<210> 557 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 igonucleotide 

<400> 557 

cccgctcgag aatcacttcc aacacaaaat 

<210> 558 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 558 

cgcggatccc atatgtggtt gctgatgaag ggc 

<210> 559 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 559 

cccgctcgag gactgcttca tcttctgc 

<210> 560 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 560 

cgcggatccc atatggaact gatgactgtt ttgc 

<210> 561 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 561 

cccgctcgag tcagactgct tcatcttct 

<210> 562 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 562 

cgcggatccc atatgagcat taaagtagcg attaacggtt tcggc 

<210> 563 

<211> 40 

<212> DNA 
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<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 563 

cccgctcgag gattttgcct gcgaagtatt ccaaagtgcg 

<210> 564 

<211> 32 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 564 

cgcggatccg ctagccccga tgttaaatcg gc 

<210> 565 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 565 

cggggatcca tcctgctctt ttttgccgg 

<210> 566 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 566 

cgcggatccg gtggtggtgg tcaaagcaag agcatccaaa 

<210> 567 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol igonucleotide 

<400> 567 

cccaagcttt tcgggcggta ttcgggcttc 

<210> 568 

<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 568 

cgcggatccg gtggtggtgg tgccacctac aaagtggac 

<210> 569 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Ol i gonucl eot i de 

<400> 569 

gcccaagctt ttgtttggct gcctcgat 

<210> 570 

<211> 34 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eot i de 

<400> 570 

cgcggatccg gtggtggtgg tacaagcgac gacg 

<210> 571 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 571 

gcccaagctt ccactcgtaa ttgacgcc 

<210> 572 

<211> 41 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 572 

cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt c 

<210> 573 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 573 

cccaagcttc gtatcatatt tcacgtgc 

<210> 574 

<211> 44 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oli gonucl eoti de 

<400> 574 

cccaagcttg gtggtggtgg tggttcagat ttggcaaacg attc 

<210> 575 

<211> 29 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 575 
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cccgctcgag cgtatcatat ttcacgtgc 

<210> 576 

<211> 45 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 576 

cccaagcttg gtggtggtgg tggtcaaagc aagagcatcc aaacc 

<210> 577 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 577 

cccgctcgag cgggcggtat tcgggctt 

<210> 578 

<211> 32 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 578 

cgcggatccg ctagccccga tgttaaatcg gc 

<210> 579 

<211> 29 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> Oligonucleotide 

<400> 579 

cggggatcca tcctgctctt ttttgccgg 

<210> 580 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 580 

cgcggatccg ctagcggaca cacttatttc ggcatc 

<210> 581 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol igonucl eotide 

<400> 581 

cgcggatccc cagcggtagc ctaatttgat 

<210> 582 
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<211> 41 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 582 

cgcggatccg gtggtggtgg ttcagatttg gcaaacgatt 

<210> 583 

<211> 28 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 583 

cccaagcttc gtatcatatt tcacgtgc 

<210> 584 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 584 

gcggcgtcga cggtggcgga ggcactggat cctcag 

<210> 585 

<211> 35 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 585 

ggaggcactg gatcctcaga tttggcaaac gattc 

<210> 586 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 586 

cccgctcgag cgtatcatat ttcacgtgc 

<210> 587 

<211> 33 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 587 

ggaattccat atgtcagatt tggcaaacga ttc 

<210> 588 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
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<220> 

<223> Ol i gonucl eoti de 

<400> 588 

cgcggatccc gtatcatatt tcacgtgc 

<210> 589 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 589 

cggggatccg ggggcggcgg tggcg 

<210> 590 

<211> 30 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 590 

cccaagctta tcctgctctt ttttgccggc 

<210> 591 

<211> 42 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 591 

cgcggatccg gtggtggtgg tcaaagcaag agcatccaaa 

<210> 592 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 592 

cccaagcttc gggcggtatt cgggcttc 

<210> 593 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 593 

ccccaagctt gggggcggcg gtggcg 

<210> 594 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 
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<400> 594 

cccgctcgag atcctgctct tttttgccgg c 

<210> 595 

<211> 45 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 595 

cccaagcttg gtggtggtgg tggtcaaagc aagagcatcc 

<210> 596 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> 01 i gonucl eoti de 

<400> 596 

cccgctcgag cgggcggtat tcgggctt 

<210> 597 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 597 

ggaggcactg gatccgcagc cacaaacgac gacga 

<210> 598 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Oligonucleotide 

<400> 598 

gcggcctcga gggtggcgga ggcactggat ccgcag 

<210> 599 

<211> 28 

<212> DNA 

<213> Artificial sequence 

<220> 

<223> Ol i gonucl eoti de 

<400> 599 

cccgctcgag acccagcttg taaggttg 

<210> 600 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 600 

ggaggcactg gatccgcagc cacaaacgac gacga 
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<210> 601 

<211> 36 

<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Oligonucleotide 

<400> 601 

gcggcctcga gggtggcgga ggcactggat ccgcag 

<210> 602 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 602 

cccgctcgag ccactcgtaa ttgacgcc 

<210> 603 

<211> 38 

<212> DNA 

<213> Artificial Sequence 
<220> 

<2 2 3> Ol i gonucl eot i de 

<400> 603 

gcggcctcga gggatccggc ggaggcggca cttctgcg 

<210> 604 

<211> 26 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> oligonucleotide 

<400> 604 

cccgctcgag gaaccggtag cctacg 

<210> 605 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 605 

ggaggcactg gatcctcaga tttggcaaac gattc 

<210> 606 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<22 3> Ol i gonucl eoti de 

<400> 606 

gcggcgtcga cggtggcgga ggcactggat cctcaga 

<210> 607 

<211> 29 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> ol i gonucl eoti de 

<400> 607 

cccgctcgag cgtatcatat ttcacgtgc 

<210> 608 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 608 

gcggcctcga gggatccgga gggggtggtg tcgcc 

<210> 609 

<211> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 609 

cccgctcgag ttgcttggcg gcaag 

<210> 610 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 610 

ggaggcactg gatccgcagc cacaaacgac gacga 

<210> 611 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 611 

gcggcctcga gggtggcgga ggcactggat ccgcag 

<210> 612 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 612 

cccgctcgag acccagcttg taaggttg 

<210> 613 

<211> 3 5 

<212> DNA 

<213> Artificial sequence 
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<220> 

<223> Ol i gonucl eoti de 

<400> 613 

ggaggcactg gatccgcagc cacaaacgac gacga 

<210> 614 

<211> 36 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eotide 

<400> 614 

gcggcctcga gggtggcgga ggcactggat ccgcag 

<210> 615 

<211> 28 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 615 

cccgctcgag ccactcgtaa ttgacgcc 

<210> 616 

<211> 35 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 616 

ggaggcactg gatcctcaga tttggcaaac gattc 

<210> 617 

<211> 37 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 617 

gcggcgtcga cggtggcgga ggcactggat cctcaga 

<210> 618 

<211> 29 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Ol i gonucl eoti de 

<400> 618 

cccgctcgag cgtatcatat ttcacgtgc 

<210> 619 

<211> 488 

<212> PRT 

<213> Artificial Sequence 

<220> 

<223> MC58 
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<400> 619 

Met Phe Lys Arg ser Val lie Ala Met Ala cys lie Phe Ala Leu ser 
1 5 10 15 

Ala Cys Gly Gly Gly Gly Gly Gly Ser Pro Asp Val Lys Ser Ala Asp 
20 25 30 

Thr Leu Ser Lys Pro Ala Ala Pro val Val Ser Glu Lys Glu Thr Glu 
35 40 45 

Ala Lys Glu Asp Ala Pro Gin Ala Gly ser Gin Gly Gin Gly Ala Pro 
50 55 60 

Ser Ala Gin Gly Ser Gin Asp Met Ala Ala val Ser Glu Glu Asn Thr 
65 70 75 80 

Gly Asn Gly Gly Ala val Thr Ala Asp Asn Pro Lys Asn Glu Asp Glu 
85 90 95 

val Ala Gin Asn Asp Met Pro Gin Asn Ala Ala Gly Thr Asp Ser Ser 
100 105 110 

Thr Pro Asn His Thr Pro Asp Pro Asn Met Leu Ala Gly Asn Met Glu 
115 120 125 

Asn Gin Ala Thr Asp Ala Gly Glu Ser Ser Gin Pro Ala Asn Gin Pro 
130 135 140 

Asp Met Ala Asn Ala Ala Asp Gly Met Gin Gly Asp Asp Pro Ser Ala 
145 150 155 160 

Gly Gly Gin Asn Ala Gly Asn Thr Ala Ala Gin Gly Ala Asn Gin Ala 
165 170 175 

Gly Asn Asn Gin Ala Ala Gly Ser Ser Asp Pro lie Pro Ala Ser Asn 
180 185 190 

Pro Ala Pro Ala Asn Gly Gly Ser Asn Phe Gly Arg Val Asp Leu Ala 
195 200 205 

Asn Gly Val Leu lie Asp Gly Pro Ser Gin Asn lie Thr Leu Thr His 
210 215 220 

Cys Lys Gly Asp Ser Cys Ser Gly Asn Asn Phe Leu Asp Glu Glu Val 
225 230 235 240 

Gin Leu Lys ser Glu Phe Glu Lys Leu ser Asp Ala Asp Lys lie ser 
245 250 255 

Asn Tyr Lys Lys Asp Gly Lys Asn Asp Lys Phe Val Gly Leu Val Ala 
260 265 270 

Asp Ser val Gin Met Lys Gly lie Asn Gin Tyr lie lie Phe Tyr Lys 
275 280 285 

Pro Lys Pro Thr Ser Phe Ala Arg Phe Arg Arg ser Ala Arg ser Arg 
290 295 300 

Arg Ser Leu Pro Ala Glu Met Pro Leu lie Pro Val Asn Gin Ala Asp 
305 310 315 320 

Thr Leu lie val Asp Gly Glu Ala Val Ser Leu Thr Gly His Ser Gly 
325 330 335 

Asn lie Phe Ala Pro Glu Gly Asn Tyr Arg Tyr Leu Thr Tyr Gly Ala 
340 345 350 

Glu Lys Leu Pro Gly Gly Ser Tyr Ala Leu Arg val Gin Gly Glu Pro 
355 360 365 
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Ala Lys Gly Glu Met Leu Ala Gly Ala Ala Val Tyr Asn Gly Glu Val 
370 375 380 

Leu His Phe His Thr Glu Asn Gly Arg Pro Tyr Pro Thr Arg Gly Arg 
385 390 ~ 395 ~ 400 

Phe Ala Ala Lys val Asp Phe Gly ser Lys Ser val Asp Gly lie lie 
405 410 415 

Asp Ser Gly Asp Asp Leu His Met Gly Thr Gin Lys Phe Lys Ala Ala 
420 425 430 

lie Asp Gly Asn Gly Phe Lys Gly Thr Trp Thr Glu Asn Gly Ser Gly 
435 440 445 

Asp val Ser Gly Lys Phe Tyr Gly Pro Ala Gly Glu Glu Val Ala Gly 
450 455 460 

Lys Tyr ser Tyr Arg pro Thr Asp Ala Glu Lys Gly Gly Phe Gly val 
465 470 475 480 

Phe Ala Gly Lys Lys Glu Gin Asp 
485 



<210> 
<211> 

<213> 


620 
427 
PRT 

Arti f i ci al Sequence 


















<lZi)> 
<223> 


2996 
























<4UU> 

Met Phe 
1 


620 
Glu 


Arg 


Ser 
5 


val 


He 


Ala 


Met 


Ala 
10 


cys 


He 


Phe 


Ala 


Leu 
15 


Ala Cys 


Gly 


Gly 
20 


Gly 


Gly 


Gly 


Gly 


ser 
25 


Pro 


Asp 


Val 


Lys 


Ser 
30 


Ala 


Thr Leu 


Ser 
35 


Lys 


Pro 


Ala 


Ala 


Pro 
40 


Val 


val 


Ala 


Glu 


Lys 
45 


Glu 


Thr 


val Lys 

50 


Glu 


Asp 


Ala 


Pro 


Gin 
55 


Ala 


Gly 


Ser 


Gin 


Gly 
60 


Gin 


Gly 


Ala 


Ser Thr 
65 


Gin 


Gly 


Ser 


Gin 
70 


Asp 


Met 


Ala 


Ala 


val 
75 


ser 


Ala 


Glu 


Asn 


Gly Asn 


Gly 


Gly 


Ala 
85 


Ala 


Thr 


Thr 


Asp 


Lys 
90 


Pro 


Lys 


Asn 


Glu 


Asp 
95 


Gly Pro 


Gin 


Asn 
100 


Asp 


Met 


Pro 


Gin 


Asn 
105 


ser 


Ala 


Glu 


ser 


Ala 
110 


Asn 


Thr Gly 


Asn 
115 


Asn 


Gin 


Pro 


Ala 


Asp 
120 


Ser 


Ser 


Asp 


Ser 


Ala 
125 


Pro 


Ala 


Asn Pro 
130 


Ala 


Pro 


Ala 


Asn 


Gly 
135 


Gly 


Ser 


Asn 


Phe 


Gly 
140 


Arg 


val 


ASP 


Ala Asn 
145 


Gly 


val 


Leu 


lie 

150 


Asp 


Gly 


Pro 


Ser 


Gin 
155 


Asn 


He 


Thr 


Leu 


His Cys 


Lys 


Gly 


Asp 

165 


Ser 


cys 


Asn 


Gly 


Asp 
170 


Asn 


Leu 


Leu 


Asp 


Glu 
175 


Ala Pro 


ser 


Lys 
180 


ser 


Glu 


Phe 


Glu 


Asn 
185 


Leu 


Asn 


Glu 


Ser 


Glu 
190 


Arg 



80 



160 
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Glu Lys 

Ala Thr 
210 

Lys Asp 
225 

Arg ser 

Gin Ala 

His Ser 

Tyr Gly 
290 

Gly Glu 
305 

Gly Glu 

Arg Gly 

Gly lie 

Lys Ala 
370 

Gly Gly 
385 

Val Ala 
Phe Gly 



Tyr Lys 
195 

Ala Val 

Lys Ser 

Arg Arg 

Asp Thr 
260 

Gly Asn 
275 

Ala Glu 

Pro Ala 

val Leu 

Arg Phe 
340 

lie Asp 
355 

Ala lie 
Gly Asp 
Gly Lys 



val Phe 
420 



Lys Asp 

Gin Ala 

Ala Ser 
230 

ser Leu 
245 

Leu He 

lie Phe 

Lys Leu 

Lys Gly 
310 

His Phe 

325 

Ala Ala 
Ser Gly 
Asp Gly 



Gly Lys 
200 

Asn Gly 
215 

Ser ser 

Pro Ala 

val Asp 

Ala Pro 
280 

Pro Gly 

295 

Glu Met 
His Thr 
Lys val 



Asp Asp 
360 

Asn Gly 
375 



Ser Asp Lys Phe 
Thr Asn 
Ser Ala 



Lys Tyr 
220 



Glu Met 
250 

Gly Glu 
265 

Glu Gly 
Gly Ser 
Leu Ala 



Arg Phe 

235 

pro Leu 



Ala val 
Asn Tyr 



Tyr Ala 
300 

Gly Thr 
315 



Glu Asn 
330 



Gly Arg 
Gly Ser 
Leu His Met Gly 



Asp Phe 
345 



Val Ser 
390 



Gly Arg 
Tyr Arg 
Ala Gly Lys Lys 



Tyr Ser 
405 



Phe Lys 
Phe Tyr 



Pro Thr 
410 

Glu Gin 
425 



Gly Thr 
380 

Gly Pro 
395 

Asp Ala 

Asp 



Thr Asn 
205 

Val lie 

Arg Arg 

lie Pro 

Ser Leu 
270 

Arg Tyr 
285 

Leu Arg 

Ala val 

Pro Tyr 

Lys Ser 
350 

Thr Gin 
365 

Trp Thr 
Ala Gly 
Glu Lys 



Leu val 

lie Tyr 

Ser Ala 
240 

val Asn 
255 

Thr Gly 

Leu Thr 

val Gin 

Tyr Asn 
320 

Pro Thr 

335 

val Asp 
Lys Phe 
Glu Asn 



Glu Glu 
400 

Gly Gly 
415 



Claims 

1 . A method for the heterologous expression of a protein of the invention, in which (a) at least one domain in the protein 
is deleted and, optionally, (b) no fusion partner is used. 

2. The method of claim 1 , in which the protein of the invention is ORF46. 

3. The method of claim 2, in which ORF46 is divided into a first domain (amino acids 1-433) and a second domain 
(amino acids 433-608). 

4. The method of claim 2, in which the protein of the invention is 564. 

5. The method of claim 4, in which protein 564 is divided into domains as shown in Figure 8. 

6. The method of claim 1 in which the protein of the invention is 961 . 

7. The method of claim 6, in which protein 961 is divided into domains as shown in Figure 12. 
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8. The method of claim 1 , in which the protein of the invention is 502 and the domain is amino acids 28 to 1 67 (numbered 
according to the MC58 sequence). 

9. The method of claim 1 , in which the protein of the invention is 287. 

5 

10. A method for the heterologous expression of a protein of the invention, in which (a) a portion of the N-terminal 
domain of the protein is deleted. 

1 1 . The method of claim 9 or claim 1 0, in which protein 287 is divided into domains A B & C shown in Figure 5. 

10 

12. The method of claim 1 1 , in which (i) domain A, (ii) domains A and B, or (iii) domains A and C are deleted. 

13. The method of claim 11, wherein (i) amino acids 1-17, (ii) amino acids 1-25, (iii) amino acids 1-69, or (iv) amino 
acids 1 -1 06, of domain A are deleted. 

15 

14. A method for the heterologous expression of a protein of the invention, in which (a) no fusion partner is used, and 
(b) the protein's native leader peptide (if present) is used. 

15. The method of claim 14, in which the protein of the invention is selected from the group consisting of: 111, 149, 
20 206, 225-1 , 235, 247-1 , 274, 283, 286, 292, 401 , 406, 502-1 , 503, 51 9-1 , 525-1 , 552, 556, 557, 570, 576-1 , 580, 

583, 664, 759, 907, 913, 920-1 , 936-1 , 953, 961 , 983, 989, Orf4, Orf7-1 , Orf9-1 , Orf23, Orf25, Orf37, Orf38, Orf40, 
Orf40.1, Orf40.2, Orf72-1, Orf76-1, Orf85-2, Orf91, Orf97-1, Orf119, Orf143.1, NMB0109, NMB2050, 008, 105, 
117-1, 121-1, 122-1, 128-1, 148, 216, 243, 308, 593, 652, 726, 926, 982, Orf83-1 and Orf143-1. 

25 16. A method for the heterologous expression of a protein of the invention, in which (a) the protein's leader peptide is 
replaced by the leader peptide from a different protein and, optionally, (b) no fusion partner is used. 

17. The method of claim 16, in which the different protein is 961 , ORF4, E.coli OmpA, or E.carotovora PelB, or in which 
the leader peptide is MKKYLFSAA. 

30 

18. The method of claim 1 7, in which the different protein is E.coli OmpA and the protein of the invention is ORF1 . 

19. The method of claim 17, in which the protein of the invention is 911 and the different protein is E.carotovora PelB 
or E.coli OmpA. 

35 

20. The method of claim 17, in which the different protein is ORF4 and the protein of the invention is 287. 

21. A method for the heterologous expression of a protein of the invention, in which (a) the protein's leader peptide is 
deleted and, optionally, (b) no fusion partner is used. 

40 

22. The method of claim 21 , in which the protein of the invention is 91 9. 

23. A method for the heterologous expression of a protein of the invention, in which expression of a protein of the 
invention is carried out at a temperature at which a toxic activity of the protein is not manifested. 

45 

24. The method of claim 23, in which protein 919 is expressed at 30°C. 

25. A method for the heterologous expression of a protein of the invention, in which protein is mutated to reduce or 
eliminate toxic activity. 

50 

26. The method of claim 25, in which the protein of the invention is 907, 919 or 922. 

27. The method of claim 26, in which 907 is mutated at Glu-1 17 (e.g. Glu^Gly). 

55 28. The method of claim 26, in which 919 is mutated at Glu-255 (e.g. Glu^Gly) and/or Glu-323 (e.g. Glu^Gly). 

29. The method of claim 26, in which 922 is mutated at Glu-1 64 (e.g. Glu^Gly), Ser-213 (e.g. Ser^Gly) and/or Asn- 
348 (e.g. Asn^Gly). 
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30. A method for the heterologous expression of a protein of the invention, in which vector pSM214 is used or vector 
pET-24b is used. 

31 . The method of claim 30, in which the protein of the invention is 953 and the vector is pSM21 4. 

32. A method for the heterologous expression of a protein of the invention, in which a protein is expressed or purified 
such that it adopts a particular multimeric form. 

33. The method of claim 32, in which protein 953 is expressed and/or purified in monomeric form. 

34. The method of claim 32, in which protein 961 is expressed and/or purified in tetrameric form. 

35. The method of claim 32, in which protein 287 is expressed and/or purified in dimeric form. 

36. The method of claim 32, in which protein 919 is expressed and/or purified in monomeric form. 

37. A method for the heterologous expression of a protein of the invention, in which the protein is expressed as a 
lipidated protein. 

38. The method of claim 37, in which the protein of the invention is 91 9, 287, ORF4, 406, 576, or ORF25. 

39. A method for the heterologous expression of a protein of the invention, in which (a) the protein's C-terminus region 
is mutated and, optionally, (b) no fusion partner is used. 

40. The method of claim 39, wherein the mutation is a substitution, an insertion, or a deletion 

41 . The method of claim 40, wherein the protein of the invention is 730, ORF29 or ORF46. 

42. A method for the heterologous expression of a protein of the invention, in which the protein's leader peptide is mutated. 

43. The method of claim 42, in which the protein of the invention is 91 9. 

44. A method for the heterologous expression of a protein, in which a poly-glycine stretch within the protein is mutated. 

45. The method of claim 44, wherein the protein is a protein of the invention. 

46. The method of claim 45, wherein the protein of the invention is 287, 741 , 983 or Tbp2. 

47. The method of claim 46, wherein (Gly) 6 is deleted from 287 or 983. 

48. The method of claim 46, wherein (Gly) 4 is deleted from Tbp2 or 741 

49. The method of claim 47 or claim 48, wherein the leader peptide is also deleted. 

50. The method of any preceding claim, in which the heterologous expression is in an E.coli host. 

51. A protein expressed by the method of any preceding claim. 

52. A heterologous protein comprising the N-terminal amino acid sequence MKKYLFSAA. 
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FIGURE 3 
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FIGURE 4 
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FIGURE 5 
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FIGURE 7 
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FIGURE 8 
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FIGURE 10 
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FIGURE 11 A 
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FIGURE 13 
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